Overview
Brought to you by YData
Dataset statistics
| Number of variables | 151 |
|---|---|
| Number of observations | 604626 |
| Missing cells | 50336207 |
| Missing cells (%) | 55.1% |
| Total size in memory | 696.6 MiB |
| Average record size in memory | 1.2 KiB |
Variable types
| Text | 151 |
|---|
Dataset
| Description | Entomology NMNH Extant Extant Specimen Records 0052484-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.ptewed |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "ENT" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
verbatimLabel has constant value "-11.7815" | Constant |
materialSampleID has constant value "-76.7017" | Constant |
verbatimDepth has constant value "220m inside cave entrance" | Constant |
verbatimCoordinateSystem has constant value "Degrees Minutes Seconds" | Constant |
verbatimSRS has constant value "1973-05-08" | Constant |
footprintSRS has constant value "128" | Constant |
footprintSpatialFit has constant value "128" | Constant |
georeferencedDate has constant value "5" | Constant |
earliestEraOrLowestErathem has constant value "Animalia" | Constant |
latestEraOrHighestErathem has constant value "Arthropoda" | Constant |
earliestPeriodOrLowestSystem has constant value "Insecta" | Constant |
group has constant value "Florida" | Constant |
formation has constant value "Pinellas" | Constant |
verbatimIdentification has constant value "SPECIES" | Constant |
identifiedByID has constant value "ACCEPTED" | Constant |
taxonConceptID has constant value "StillImage" | Constant |
acceptedNameUsage has constant value "false" | Constant |
nameAccordingTo has constant value "1" | Constant |
namePublishedIn has constant value "54" | Constant |
namePublishedInYear has constant value "216" | Constant |
subtribe has constant value "EML" | Constant |
subgenus has constant value "true" | Constant |
verbatimTaxonRank has constant value "PER" | Constant |
nomenclaturalCode has constant value "PER.16_1" | Constant |
nomenclaturalStatus has constant value "PER.16.6_1" | Constant |
taxonRemarks has constant value "Huarochiri" | Constant |
subgenusKey has constant value "Insecta" | Constant |
protocol has constant value "EML" | Constant |
projectId has constant value "roseni" | Constant |
isSequenced has constant value "false" | Constant |
catalogNumber has 233418 (38.6%) missing values | Missing |
recordNumber has 604589 (> 99.9%) missing values | Missing |
recordedBy has 203336 (33.6%) missing values | Missing |
sex has 384462 (63.6%) missing values | Missing |
lifeStage has 184129 (30.5%) missing values | Missing |
preparations has 42051 (7.0%) missing values | Missing |
occurrenceRemarks has 459276 (76.0%) missing values | Missing |
verbatimLabel has 604625 (> 99.9%) missing values | Missing |
materialSampleID has 604625 (> 99.9%) missing values | Missing |
fieldNumber has 600377 (99.3%) missing values | Missing |
eventDate has 239769 (39.7%) missing values | Missing |
startDayOfYear has 270965 (44.8%) missing values | Missing |
endDayOfYear has 270965 (44.8%) missing values | Missing |
year has 240229 (39.7%) missing values | Missing |
month has 254573 (42.1%) missing values | Missing |
day has 314935 (52.1%) missing values | Missing |
verbatimEventDate has 396306 (65.5%) missing values | Missing |
habitat has 604427 (> 99.9%) missing values | Missing |
locationID has 603581 (99.8%) missing values | Missing |
higherGeography has 156072 (25.8%) missing values | Missing |
continent has 199137 (32.9%) missing values | Missing |
islandGroup has 602107 (99.6%) missing values | Missing |
island has 595261 (98.5%) missing values | Missing |
countryCode has 163440 (27.0%) missing values | Missing |
stateProvince has 173217 (28.6%) missing values | Missing |
county has 254826 (42.1%) missing values | Missing |
locality has 158340 (26.2%) missing values | Missing |
verbatimElevation has 594692 (98.4%) missing values | Missing |
verbatimDepth has 604620 (> 99.9%) missing values | Missing |
minimumDistanceAboveSurfaceInMeters has 604624 (> 99.9%) missing values | Missing |
decimalLatitude has 285575 (47.2%) missing values | Missing |
decimalLongitude has 285575 (47.2%) missing values | Missing |
coordinateUncertaintyInMeters has 592674 (98.0%) missing values | Missing |
pointRadiusSpatialFit has 604624 (> 99.9%) missing values | Missing |
verbatimCoordinateSystem has 604625 (> 99.9%) missing values | Missing |
verbatimSRS has 604625 (> 99.9%) missing values | Missing |
footprintSRS has 604625 (> 99.9%) missing values | Missing |
footprintSpatialFit has 604625 (> 99.9%) missing values | Missing |
georeferencedBy has 604623 (> 99.9%) missing values | Missing |
georeferencedDate has 604625 (> 99.9%) missing values | Missing |
georeferenceProtocol has 366755 (60.7%) missing values | Missing |
georeferenceSources has 604624 (> 99.9%) missing values | Missing |
georeferenceRemarks has 596178 (98.6%) missing values | Missing |
latestEonOrHighestEonothem has 604624 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 604624 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 604624 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 604624 (> 99.9%) missing values | Missing |
latestPeriodOrHighestSystem has 604624 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 604622 (> 99.9%) missing values | Missing |
earliestAgeOrLowestStage has 604624 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 604624 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 604622 (> 99.9%) missing values | Missing |
group has 604625 (> 99.9%) missing values | Missing |
formation has 604625 (> 99.9%) missing values | Missing |
member has 604624 (> 99.9%) missing values | Missing |
bed has 604624 (> 99.9%) missing values | Missing |
verbatimIdentification has 604624 (> 99.9%) missing values | Missing |
identificationQualifier has 603189 (99.8%) missing values | Missing |
typeStatus has 486591 (80.5%) missing values | Missing |
identifiedBy has 454955 (75.2%) missing values | Missing |
identifiedByID has 604624 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 604622 (> 99.9%) missing values | Missing |
identificationRemarks has 604622 (> 99.9%) missing values | Missing |
taxonID has 604624 (> 99.9%) missing values | Missing |
namePublishedInID has 604624 (> 99.9%) missing values | Missing |
taxonConceptID has 604625 (> 99.9%) missing values | Missing |
acceptedNameUsage has 604624 (> 99.9%) missing values | Missing |
parentNameUsage has 604623 (> 99.9%) missing values | Missing |
originalNameUsage has 604624 (> 99.9%) missing values | Missing |
nameAccordingTo has 604624 (> 99.9%) missing values | Missing |
namePublishedIn has 604624 (> 99.9%) missing values | Missing |
namePublishedInYear has 604624 (> 99.9%) missing values | Missing |
superfamily has 604624 (> 99.9%) missing values | Missing |
family has 11642 (1.9%) missing values | Missing |
subfamily has 604624 (> 99.9%) missing values | Missing |
subtribe has 604624 (> 99.9%) missing values | Missing |
genus has 19883 (3.3%) missing values | Missing |
genericName has 19882 (3.3%) missing values | Missing |
subgenus has 604624 (> 99.9%) missing values | Missing |
specificEpithet has 109508 (18.1%) missing values | Missing |
infraspecificEpithet has 586367 (97.0%) missing values | Missing |
cultivarEpithet has 604624 (> 99.9%) missing values | Missing |
verbatimTaxonRank has 604625 (> 99.9%) missing values | Missing |
vernacularName has 604624 (> 99.9%) missing values | Missing |
nomenclaturalCode has 604625 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 604625 (> 99.9%) missing values | Missing |
taxonRemarks has 604625 (> 99.9%) missing values | Missing |
elevation has 557870 (92.3%) missing values | Missing |
elevationAccuracy has 573282 (94.8%) missing values | Missing |
depth has 604592 (> 99.9%) missing values | Missing |
depthAccuracy has 604615 (> 99.9%) missing values | Missing |
distanceFromCentroidInMeters has 601631 (99.5%) missing values | Missing |
mediaType has 369838 (61.2%) missing values | Missing |
familyKey has 11642 (1.9%) missing values | Missing |
genusKey has 19883 (3.3%) missing values | Missing |
subgenusKey has 604624 (> 99.9%) missing values | Missing |
speciesKey has 109501 (18.1%) missing values | Missing |
species has 109503 (18.1%) missing values | Missing |
repatriated has 162658 (26.9%) missing values | Missing |
projectId has 604625 (> 99.9%) missing values | Missing |
gbifRegion has 163113 (27.0%) missing values | Missing |
level0Gid has 288722 (47.8%) missing values | Missing |
level0Name has 288722 (47.8%) missing values | Missing |
level1Gid has 288806 (47.8%) missing values | Missing |
level1Name has 288804 (47.8%) missing values | Missing |
level2Gid has 297499 (49.2%) missing values | Missing |
level2Name has 297510 (49.2%) missing values | Missing |
level3Gid has 540301 (89.4%) missing values | Missing |
level3Name has 541181 (89.5%) missing values | Missing |
iucnRedListCategory has 96088 (15.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:47:20.970730 |
|---|---|
| Analysis finished | 2025-01-08 22:47:55.454125 |
| Duration | 34.48 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 604626 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 604626 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1321729650 |
|---|---|
| 2nd row | 1320180785 |
| 3rd row | 4403931423 |
| 4th row | 1320185860 |
| 5th row | 1320185980 |
| Value | Count | Frequency (%) |
| 1321729650 | 1 | < 0.1% |
| 1321751610 | 1 | < 0.1% |
| 1828939237 | 1 | < 0.1% |
| 1321753851 | 1 | < 0.1% |
| 4403917418 | 1 | < 0.1% |
| 1321742115 | 1 | < 0.1% |
| 4403931423 | 1 | < 0.1% |
| 1320185860 | 1 | < 0.1% |
| 1320185980 | 1 | < 0.1% |
| 2236094411 | 1 | < 0.1% |
| Other values (604616) | 604616 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1132843 | |
| 3 | 860404 | |
| 2 | 781753 | |
| 0 | 530599 | |
| 8 | 513679 | |
| 9 | 488164 | |
| 7 | 473950 | |
| 4 | 451737 | 7.5% |
| 5 | 410650 | 6.8% |
| 6 | 402481 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6046260 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1132843 | |
| 3 | 860404 | |
| 2 | 781753 | |
| 0 | 530599 | |
| 8 | 513679 | |
| 9 | 488164 | |
| 7 | 473950 | |
| 4 | 451737 | 7.5% |
| 5 | 410650 | 6.8% |
| 6 | 402481 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6046260 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1132843 | |
| 3 | 860404 | |
| 2 | 781753 | |
| 0 | 530599 | |
| 8 | 513679 | |
| 9 | 488164 | |
| 7 | 473950 | |
| 4 | 451737 | 7.5% |
| 5 | 410650 | 6.8% |
| 6 | 402481 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6046260 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1132843 | |
| 3 | 860404 | |
| 2 | 781753 | |
| 0 | 530599 | |
| 8 | 513679 | |
| 9 | 488164 | |
| 7 | 473950 | |
| 4 | 451737 | 7.5% |
| 5 | 410650 | 6.8% |
| 6 | 402481 | 6.7% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1209252 | |
| 0 | 1209252 | |
| _ | 1209252 | |
| 1 | 604626 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1813878 | |
| Uppercase Letter | 1209252 | |
| Connector Punctuation | 1209252 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1209252 | |
| 1 | 604626 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1209252 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1209252 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3023130 | |
| Latin | 1209252 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1209252 | |
| _ | 1209252 | |
| 1 | 604626 |
Latin
| Value | Count | Frequency (%) |
| C | 1209252 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4232382 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1209252 | |
| 0 | 1209252 | |
| _ | 1209252 | |
| 1 | 604626 |
modified
Text
| Distinct | 56588 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 30778 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | 2013-09-16T11:56:00Z |
|---|---|
| 2nd row | 2016-06-09T14:33:00Z |
| 3rd row | 2023-08-23T09:36:00Z |
| 4th row | 2023-05-19T10:32:00Z |
| 5th row | 2015-10-05T15:58:00Z |
| Value | Count | Frequency (%) |
| 2017-04-17t11:48:00z | 9681 | 1.6% |
| 2017-04-17t11:49:00z | 9420 | 1.6% |
| 2017-04-17t11:50:00z | 8719 | 1.4% |
| 2017-04-17t11:47:00z | 8654 | 1.4% |
| 2017-04-17t11:46:00z | 6000 | 1.0% |
| 2021-08-23t15:49:00z | 3095 | 0.5% |
| 2021-08-23t15:48:00z | 3057 | 0.5% |
| 2016-07-27t14:05:00z | 3041 | 0.5% |
| 2016-07-27t14:06:00z | 1844 | 0.3% |
| 2021-08-23t15:50:00z | 1737 | 0.3% |
| Other values (56578) | 549378 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2927853 | |
| 1 | 1566143 | |
| 2 | 1372338 | |
| - | 1209252 | |
| : | 1209252 | |
| T | 604626 | 5.0% |
| Z | 604626 | 5.0% |
| 3 | 593038 | 4.9% |
| 5 | 494587 | 4.1% |
| 4 | 456514 | 3.8% |
| Other values (4) | 1054291 | 8.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8464764 | |
| Dash Punctuation | 1209252 | 10.0% |
| Other Punctuation | 1209252 | 10.0% |
| Uppercase Letter | 1209252 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2927853 | |
| 1 | 1566143 | |
| 2 | 1372338 | |
| 3 | 593038 | 7.0% |
| 5 | 494587 | 5.8% |
| 4 | 456514 | 5.4% |
| 9 | 314284 | 3.7% |
| 7 | 310920 | 3.7% |
| 6 | 238170 | 2.8% |
| 8 | 190917 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 604626 | |
| Z | 604626 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1209252 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1209252 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10883268 | |
| Latin | 1209252 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2927853 | |
| 1 | 1566143 | |
| 2 | 1372338 | |
| - | 1209252 | |
| : | 1209252 | |
| 3 | 593038 | 5.4% |
| 5 | 494587 | 4.5% |
| 4 | 456514 | 4.2% |
| 9 | 314284 | 2.9% |
| 7 | 310920 | 2.9% |
| Other values (2) | 429087 | 3.9% |
Latin
| Value | Count | Frequency (%) |
| T | 604626 | |
| Z | 604626 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12092520 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2927853 | |
| 1 | 1566143 | |
| 2 | 1372338 | |
| - | 1209252 | |
| : | 1209252 | |
| T | 604626 | 5.0% |
| Z | 604626 | 5.0% |
| 3 | 593038 | 4.9% |
| 5 | 494587 | 4.1% |
| 4 | 456514 | 3.8% |
| Other values (4) | 1054291 | 8.7% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 604626 | |
| museum | 604626 | |
| of | 604626 | |
| natural | 604626 | |
| history | 604626 | |
| smithsonian | 604626 | |
| institution | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4232382 | |
| i | 3627756 | |
| 3627756 | ||
| a | 3023130 | 8.5% |
| o | 3023130 | 8.5% |
| n | 3023130 | 8.5% |
| s | 2418504 | 6.8% |
| u | 2418504 | 6.8% |
| r | 1209252 | 3.4% |
| m | 1209252 | 3.4% |
| Other values (11) | 7860138 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27812796 | |
| Space Separator | 3627756 | 10.2% |
| Uppercase Letter | 3627756 | 10.2% |
| Other Punctuation | 604626 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4232382 | |
| i | 3627756 | |
| a | 3023130 | |
| o | 3023130 | |
| n | 3023130 | |
| s | 2418504 | |
| u | 2418504 | |
| r | 1209252 | 4.3% |
| m | 1209252 | 4.3% |
| l | 1209252 | 4.3% |
| Other values (4) | 2418504 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1209252 | |
| M | 604626 | |
| H | 604626 | |
| S | 604626 | |
| I | 604626 |
Space Separator
| Value | Count | Frequency (%) |
| 3627756 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 604626 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31440552 | |
| Common | 4232382 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4232382 | |
| i | 3627756 | |
| a | 3023130 | |
| o | 3023130 | |
| n | 3023130 | |
| s | 2418504 | 7.7% |
| u | 2418504 | 7.7% |
| r | 1209252 | 3.8% |
| m | 1209252 | 3.8% |
| N | 1209252 | 3.8% |
| Other values (9) | 6046260 |
Common
| Value | Count | Frequency (%) |
| 3627756 | ||
| , | 604626 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35672934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4232382 | |
| i | 3627756 | |
| 3627756 | ||
| a | 3023130 | 8.5% |
| o | 3023130 | 8.5% |
| n | 3023130 | 8.5% |
| s | 2418504 | 6.8% |
| u | 2418504 | 6.8% |
| r | 1209252 | 3.4% |
| m | 1209252 | 3.4% |
| Other values (11) | 7860138 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2418504 | |
| : | 2418504 | |
| l | 1813878 | 10.3% |
| i | 1209252 | 6.9% |
| r | 1209252 | 6.9% |
| c | 1209252 | 6.9% |
| g | 604626 | 3.4% |
| 7 | 604626 | 3.4% |
| 8 | 604626 | 3.4% |
| 4 | 604626 | 3.4% |
| Other values (8) | 4837008 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11487894 | |
| Other Punctuation | 3023130 | 17.2% |
| Decimal Number | 3023130 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2418504 | |
| l | 1813878 | |
| i | 1209252 | |
| r | 1209252 | |
| c | 1209252 | |
| g | 604626 | 5.3% |
| u | 604626 | 5.3% |
| b | 604626 | 5.3% |
| d | 604626 | 5.3% |
| s | 604626 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 604626 | |
| 8 | 604626 | |
| 4 | 604626 | |
| 3 | 604626 | |
| 1 | 604626 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2418504 | |
| . | 604626 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11487894 | |
| Common | 6046260 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2418504 | |
| l | 1813878 | |
| i | 1209252 | |
| r | 1209252 | |
| c | 1209252 | |
| g | 604626 | 5.3% |
| u | 604626 | 5.3% |
| b | 604626 | 5.3% |
| d | 604626 | 5.3% |
| s | 604626 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 2418504 | |
| 7 | 604626 | 10.0% |
| 8 | 604626 | 10.0% |
| 4 | 604626 | 10.0% |
| 3 | 604626 | 10.0% |
| . | 604626 | 10.0% |
| 1 | 604626 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17534154 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2418504 | |
| : | 2418504 | |
| l | 1813878 | 10.3% |
| i | 1209252 | 6.9% |
| r | 1209252 | 6.9% |
| c | 1209252 | 6.9% |
| g | 604626 | 3.4% |
| 7 | 604626 | 3.4% |
| 8 | 604626 | 3.4% |
| 4 | 604626 | 3.4% |
| Other values (8) | 4837008 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
|---|---|
| 2nd row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| 3rd row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| 4th row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| 5th row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| Value | Count | Frequency (%) |
| urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3023130 | 11.1% |
| a | 2418504 | 8.9% |
| - | 2418504 | 8.9% |
| d | 1813878 | 6.7% |
| c | 1813878 | 6.7% |
| u | 1813878 | 6.7% |
| 8 | 1209252 | 4.4% |
| 3 | 1209252 | 4.4% |
| : | 1209252 | 4.4% |
| 9 | 1209252 | 4.4% |
| Other values (12) | 9069390 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12092520 | |
| Decimal Number | 11487894 | |
| Dash Punctuation | 2418504 | 8.9% |
| Other Punctuation | 1209252 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3023130 | |
| 8 | 1209252 | 10.5% |
| 3 | 1209252 | 10.5% |
| 9 | 1209252 | 10.5% |
| 6 | 1209252 | 10.5% |
| 2 | 1209252 | 10.5% |
| 1 | 604626 | 5.3% |
| 4 | 604626 | 5.3% |
| 7 | 604626 | 5.3% |
| 5 | 604626 | 5.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2418504 | |
| d | 1813878 | |
| c | 1813878 | |
| u | 1813878 | |
| b | 1209252 | |
| e | 604626 | 5.0% |
| i | 604626 | 5.0% |
| r | 604626 | 5.0% |
| n | 604626 | 5.0% |
| f | 604626 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2418504 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1209252 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15115650 | |
| Latin | 12092520 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3023130 | |
| - | 2418504 | |
| 8 | 1209252 | 8.0% |
| 3 | 1209252 | 8.0% |
| : | 1209252 | 8.0% |
| 9 | 1209252 | 8.0% |
| 6 | 1209252 | 8.0% |
| 2 | 1209252 | 8.0% |
| 1 | 604626 | 4.0% |
| 4 | 604626 | 4.0% |
| Other values (2) | 1209252 | 8.0% |
Latin
| Value | Count | Frequency (%) |
| a | 2418504 | |
| d | 1813878 | |
| c | 1813878 | |
| u | 1813878 | |
| b | 1209252 | |
| e | 604626 | 5.0% |
| i | 604626 | 5.0% |
| r | 604626 | 5.0% |
| n | 604626 | 5.0% |
| f | 604626 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27208170 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3023130 | 11.1% |
| a | 2418504 | 8.9% |
| - | 2418504 | 8.9% |
| d | 1813878 | 6.7% |
| c | 1813878 | 6.7% |
| u | 1813878 | 6.7% |
| 8 | 1209252 | 4.4% |
| 3 | 1209252 | 4.4% |
| : | 1209252 | 4.4% |
| 9 | 1209252 | 4.4% |
| Other values (12) | 9069390 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 604626 | |
| S | 604626 | |
| N | 604626 | |
| M | 604626 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2418504 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 604626 | |
| S | 604626 | |
| N | 604626 | |
| M | 604626 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2418504 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 604626 | |
| S | 604626 | |
| N | 604626 | |
| M | 604626 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2418504 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 604626 | |
| S | 604626 | |
| N | 604626 | |
| M | 604626 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ENT |
|---|---|
| 2nd row | ENT |
| 3rd row | ENT |
| 4th row | ENT |
| 5th row | ENT |
| Value | Count | Frequency (%) |
| ent | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 604626 | |
| N | 604626 | |
| T | 604626 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1813878 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 604626 | |
| N | 604626 | |
| T | 604626 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1813878 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 604626 | |
| N | 604626 | |
| T | 604626 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1813878 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 604626 | |
| N | 604626 | |
| T | 604626 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 604626 | |
| extant | 604626 | |
| biology | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1209252 | 10.5% |
| 1209252 | 10.5% | |
| t | 1209252 | 10.5% |
| o | 1209252 | 10.5% |
| M | 604626 | 5.3% |
| H | 604626 | 5.3% |
| E | 604626 | 5.3% |
| x | 604626 | 5.3% |
| a | 604626 | 5.3% |
| n | 604626 | 5.3% |
| Other values (5) | 3023130 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6650886 | |
| Uppercase Letter | 3627756 | |
| Space Separator | 1209252 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1209252 | |
| o | 1209252 | |
| x | 604626 | |
| a | 604626 | |
| n | 604626 | |
| i | 604626 | |
| l | 604626 | |
| g | 604626 | |
| y | 604626 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1209252 | |
| M | 604626 | |
| H | 604626 | |
| E | 604626 | |
| B | 604626 |
Space Separator
| Value | Count | Frequency (%) |
| 1209252 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10278642 | |
| Common | 1209252 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1209252 | |
| t | 1209252 | |
| o | 1209252 | |
| M | 604626 | 5.9% |
| H | 604626 | 5.9% |
| E | 604626 | 5.9% |
| x | 604626 | 5.9% |
| a | 604626 | 5.9% |
| n | 604626 | 5.9% |
| B | 604626 | 5.9% |
| Other values (4) | 2418504 |
Common
| Value | Count | Frequency (%) |
| 1209252 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11487894 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1209252 | 10.5% |
| 1209252 | 10.5% | |
| t | 1209252 | 10.5% |
| o | 1209252 | 10.5% |
| M | 604626 | 5.3% |
| H | 604626 | 5.3% |
| E | 604626 | 5.3% |
| x | 604626 | 5.3% |
| a | 604626 | 5.3% |
| n | 604626 | 5.3% |
| Other values (5) | 3023130 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.99374986 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 600847 | |
| human_observation | 3779 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3008014 | |
| R | 1205473 | |
| S | 1205473 | |
| P | 1201694 | 11.0% |
| N | 608405 | 5.6% |
| M | 604626 | 5.6% |
| I | 604626 | 5.6% |
| _ | 604626 | 5.6% |
| V | 604626 | 5.6% |
| C | 600847 | 5.5% |
| Other values (7) | 631079 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10274863 | |
| Connector Punctuation | 604626 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3008014 | |
| R | 1205473 | |
| S | 1205473 | |
| P | 1201694 | 11.7% |
| N | 608405 | 5.9% |
| M | 604626 | 5.9% |
| I | 604626 | 5.9% |
| V | 604626 | 5.9% |
| C | 600847 | 5.8% |
| D | 600847 | 5.8% |
| Other values (6) | 30232 | 0.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 604626 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10274863 | |
| Common | 604626 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3008014 | |
| R | 1205473 | |
| S | 1205473 | |
| P | 1201694 | 11.7% |
| N | 608405 | 5.9% |
| M | 604626 | 5.9% |
| I | 604626 | 5.9% |
| V | 604626 | 5.9% |
| C | 600847 | 5.8% |
| D | 600847 | 5.8% |
| Other values (6) | 30232 | 0.3% |
Common
| Value | Count | Frequency (%) |
| _ | 604626 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10879489 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3008014 | |
| R | 1205473 | |
| S | 1205473 | |
| P | 1201694 | 11.0% |
| N | 608405 | 5.6% |
| M | 604626 | 5.6% |
| I | 604626 | 5.6% |
| _ | 604626 | 5.6% |
| V | 604626 | 5.6% |
| C | 600847 | 5.5% |
| Other values (7) | 631079 | 5.8% |
occurrenceID
Text
Unique 
| Distinct | 604626 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 604626 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3c83a10d1-1e59-4b08-af5b-28d12d2d0c80 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/383bb510d-d5ce-4c09-b4c4-bc1482fbaf28 |
| 3rd row | http://n2t.net/ark:/65665/383f13aa6-a5b6-40bc-bddc-b42c557aebfc |
| 4th row | http://n2t.net/ark:/65665/383f4d560-c2d2-485c-906c-b6dad303fd7a |
| 5th row | http://n2t.net/ark:/65665/383f634da-bb58-423c-85f4-a267b04c5ee5 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3c83a10d1-1e59-4b08-af5b-28d12d2d0c80 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c932a059-56b2-4846-9e97-741d7bdde29c | 1 | < 0.1% |
| http://n2t.net/ark:/65665/384cb9f0c-76d8-41b2-9a2e-351c10a4ab3f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c94d744a-d127-4564-9b0c-5d349a138dd0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/384c3715b-7768-468a-b76b-a68ff7a554d0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c8c6462b-a9e9-4efa-9205-6fb4e5ef4e65 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383f13aa6-a5b6-40bc-bddc-b42c557aebfc | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383f4d560-c2d2-485c-906c-b6dad303fd7a | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383f634da-bb58-423c-85f4-a267b04c5ee5 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c898aee2-d463-49d7-ad9c-6fd423e170e1 | 1 | < 0.1% |
| Other values (604616) | 604616 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 3023130 | 7.9% |
| 6 | 2949272 | 7.7% |
| - | 2418504 | 6.3% |
| t | 2418504 | 6.3% |
| 5 | 2343156 | 6.2% |
| a | 1889243 | 5.0% |
| 2 | 1738952 | 4.6% |
| e | 1738278 | 4.6% |
| 3 | 1737371 | 4.6% |
| 4 | 1737249 | 4.6% |
| Other values (16) | 16097779 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16478170 | |
| Lowercase Letter | 14357756 | |
| Other Punctuation | 4837008 | 12.7% |
| Dash Punctuation | 2418504 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2418504 | |
| a | 1889243 | |
| e | 1738278 | |
| b | 1285981 | |
| n | 1209252 | |
| d | 1134303 | |
| c | 1132879 | |
| f | 1130812 | |
| k | 604626 | 4.2% |
| r | 604626 | 4.2% |
| Other values (2) | 1209252 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2949272 | |
| 5 | 2343156 | |
| 2 | 1738952 | |
| 3 | 1737371 | |
| 4 | 1737249 | |
| 8 | 1286190 | |
| 9 | 1284662 | |
| 0 | 1134061 | 6.9% |
| 1 | 1133861 | 6.9% |
| 7 | 1133396 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3023130 | |
| : | 1209252 | 25.0% |
| . | 604626 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2418504 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23733682 | |
| Latin | 14357756 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 3023130 | |
| 6 | 2949272 | |
| - | 2418504 | |
| 5 | 2343156 | |
| 2 | 1738952 | |
| 3 | 1737371 | |
| 4 | 1737249 | |
| 8 | 1286190 | 5.4% |
| 9 | 1284662 | 5.4% |
| : | 1209252 | 5.1% |
| Other values (4) | 4005944 |
Latin
| Value | Count | Frequency (%) |
| t | 2418504 | |
| a | 1889243 | |
| e | 1738278 | |
| b | 1285981 | |
| n | 1209252 | |
| d | 1134303 | |
| c | 1132879 | |
| f | 1130812 | |
| k | 604626 | 4.2% |
| r | 604626 | 4.2% |
| Other values (2) | 1209252 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38091438 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 3023130 | 7.9% |
| 6 | 2949272 | 7.7% |
| - | 2418504 | 6.3% |
| t | 2418504 | 6.3% |
| 5 | 2343156 | 6.2% |
| a | 1889243 | 5.0% |
| 2 | 1738952 | 4.6% |
| e | 1738278 | 4.6% |
| 3 | 1737371 | 4.6% |
| 4 | 1737249 | 4.6% |
| Other values (16) | 16097779 |
catalogNumber
Text
Missing 
| Distinct | 371195 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 233418 |
| Missing (%) | 38.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 15 |
| Mean length | 15.03873031 |
| Min length | 12 |
Unique
| Unique | 371182 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | USNMENT00831303 |
|---|---|
| 2nd row | USNMENT00356408 |
| 3rd row | USNMENT01436172 |
| 4th row | USNMENT00357025 |
| 5th row | USNMENT00314717 |
| Value | Count | Frequency (%) |
| usnment00937212 | 2 | < 0.1% |
| usnment01200936 | 2 | < 0.1% |
| usnment00385731 | 2 | < 0.1% |
| usnment00937219 | 2 | < 0.1% |
| usnment00935890 | 2 | < 0.1% |
| usnment00533165 | 2 | < 0.1% |
| usnment00377587 | 2 | < 0.1% |
| usnment00937222 | 2 | < 0.1% |
| usnment00937214 | 2 | < 0.1% |
| usnment00381323 | 2 | < 0.1% |
| Other values (371185) | 371188 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 804605 | |
| N | 741758 | |
| 1 | 376970 | 6.8% |
| S | 371208 | 6.6% |
| U | 371164 | 6.6% |
| M | 371164 | 6.6% |
| E | 370588 | 6.6% |
| T | 370588 | 6.6% |
| 3 | 302793 | 5.4% |
| 4 | 225899 | 4.0% |
| Other values (11) | 1275760 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2981860 | |
| Uppercase Letter | 2596558 | |
| Other Punctuation | 4077 | 0.1% |
| Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 804605 | |
| 1 | 376970 | |
| 3 | 302793 | 10.2% |
| 4 | 225899 | 7.6% |
| 2 | 225441 | 7.6% |
| 5 | 215950 | 7.2% |
| 8 | 215550 | 7.2% |
| 7 | 210801 | 7.1% |
| 6 | 202403 | 6.8% |
| 9 | 201448 | 6.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 741758 | |
| S | 371208 | |
| U | 371164 | |
| M | 371164 | |
| E | 370588 | |
| T | 370588 | |
| C | 44 | < 0.1% |
| A | 44 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 1 | |
| a | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4077 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2985937 | |
| Latin | 2596560 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 804605 | |
| 1 | 376970 | |
| 3 | 302793 | 10.1% |
| 4 | 225899 | 7.6% |
| 2 | 225441 | 7.6% |
| 5 | 215950 | 7.2% |
| 8 | 215550 | 7.2% |
| 7 | 210801 | 7.1% |
| 6 | 202403 | 6.8% |
| 9 | 201448 | 6.7% |
Latin
| Value | Count | Frequency (%) |
| N | 741758 | |
| S | 371208 | |
| U | 371164 | |
| M | 371164 | |
| E | 370588 | |
| T | 370588 | |
| C | 44 | < 0.1% |
| A | 44 | < 0.1% |
| b | 1 | < 0.1% |
| a | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5582497 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 804605 | |
| N | 741758 | |
| 1 | 376970 | 6.8% |
| S | 371208 | 6.6% |
| U | 371164 | 6.6% |
| M | 371164 | 6.6% |
| E | 370588 | 6.6% |
| T | 370588 | 6.6% |
| 3 | 302793 | 5.4% |
| 4 | 225899 | 4.0% |
| Other values (11) | 1275760 |
recordNumber
Text
Missing 
| Distinct | 33 |
|---|---|
| Distinct (%) | 89.2% |
| Missing | 604589 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 26 |
| Mean length | 17.18918919 |
| Min length | 4 |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | 86.5% |
Sample
| 1st row | Collection number "14,957" |
|---|---|
| 2nd row | Lot 607, Sub 182 |
| 3rd row | 4012 |
| 4th row | Dognin Collection |
| 5th row | 12.097 |
| Value | Count | Frequency (%) |
| collection | 10 | 10.0% |
| no | 9 | 9.0% |
| walsingham | 7 | 7.0% |
| dognin | 5 | 5.0% |
| hopkins | 3 | 3.0% |
| quaintance | 2 | 2.0% |
| wlsm | 2 | 2.0% |
| townes | 2 | 2.0% |
| number | 2 | 2.0% |
| from | 2 | 2.0% |
| Other values (56) | 56 |
Most occurring characters
| Value | Count | Frequency (%) |
| 63 | 9.9% | |
| o | 52 | 8.2% |
| n | 47 | 7.4% |
| l | 39 | 6.1% |
| i | 33 | 5.2% |
| . | 26 | 4.1% |
| e | 25 | 3.9% |
| a | 22 | 3.5% |
| t | 19 | 3.0% |
| 1 | 19 | 3.0% |
| Other values (47) | 291 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 348 | |
| Decimal Number | 114 | 17.9% |
| Uppercase Letter | 67 | 10.5% |
| Space Separator | 63 | 9.9% |
| Other Punctuation | 40 | 6.3% |
| Dash Punctuation | 2 | 0.3% |
| Open Punctuation | 1 | 0.2% |
| Close Punctuation | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 52 | |
| n | 47 | |
| l | 39 | |
| i | 33 | |
| e | 25 | 7.2% |
| a | 22 | 6.3% |
| t | 19 | 5.5% |
| c | 18 | 5.2% |
| s | 16 | 4.6% |
| g | 14 | 4.0% |
| Other values (11) | 63 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 14 | |
| W | 9 | |
| N | 9 | |
| H | 6 | |
| D | 5 | 7.5% |
| S | 4 | 6.0% |
| M | 3 | 4.5% |
| Q | 2 | 3.0% |
| T | 2 | 3.0% |
| U | 2 | 3.0% |
| Other values (9) | 11 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 19 | |
| 7 | 15 | |
| 0 | 14 | |
| 8 | 14 | |
| 5 | 12 | |
| 9 | 12 | |
| 4 | 8 | |
| 2 | 8 | |
| 6 | 7 | 6.1% |
| 3 | 5 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26 | |
| " | 12 | |
| , | 2 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 63 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 415 | |
| Common | 221 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 52 | 12.5% |
| n | 47 | 11.3% |
| l | 39 | 9.4% |
| i | 33 | 8.0% |
| e | 25 | 6.0% |
| a | 22 | 5.3% |
| t | 19 | 4.6% |
| c | 18 | 4.3% |
| s | 16 | 3.9% |
| C | 14 | 3.4% |
| Other values (30) | 130 |
Common
| Value | Count | Frequency (%) |
| 63 | ||
| . | 26 | |
| 1 | 19 | 8.6% |
| 7 | 15 | 6.8% |
| 0 | 14 | 6.3% |
| 8 | 14 | 6.3% |
| 5 | 12 | 5.4% |
| " | 12 | 5.4% |
| 9 | 12 | 5.4% |
| 4 | 8 | 3.6% |
| Other values (7) | 26 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 636 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 63 | 9.9% | |
| o | 52 | 8.2% |
| n | 47 | 7.4% |
| l | 39 | 6.1% |
| i | 33 | 5.2% |
| . | 26 | 4.1% |
| e | 25 | 3.9% |
| a | 22 | 3.5% |
| t | 19 | 3.0% |
| 1 | 19 | 3.0% |
| Other values (47) | 291 |
recordedBy
Text
Missing 
| Distinct | 18726 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 203336 |
| Missing (%) | 33.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 90 |
|---|---|
| Median length | 84 |
| Mean length | 11.25684667 |
| Min length | 1 |
Unique
| Unique | 9104 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | M. Ortiz B. |
|---|---|
| 2nd row | [Not Stated] |
| 3rd row | S. Roble |
| 4th row | [Not Stated] |
| 5th row | C. Flint |
| Value | Count | Frequency (%) |
| not | 65711 | 7.2% |
| stated | 65695 | 7.2% |
| l | 40182 | 4.4% |
| 39875 | 4.4% | |
| j | 36886 | 4.0% |
| macior | 31232 | 3.4% |
| d | 28468 | 3.1% |
| c | 27156 | 3.0% |
| r | 25636 | 2.8% |
| b | 22044 | 2.4% |
| Other values (10691) | 530776 |
Most occurring characters
| Value | Count | Frequency (%) |
| 512371 | 11.3% | |
| . | 355530 | 7.9% |
| t | 305132 | 6.8% |
| a | 299337 | 6.6% |
| e | 290066 | 6.4% |
| o | 240179 | 5.3% |
| r | 229270 | 5.1% |
| i | 173763 | 3.8% |
| n | 169850 | 3.8% |
| l | 136863 | 3.0% |
| Other values (73) | 1804899 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2587572 | |
| Uppercase Letter | 878220 | 19.4% |
| Space Separator | 512371 | 11.3% |
| Other Punctuation | 405394 | 9.0% |
| Open Punctuation | 65746 | 1.5% |
| Close Punctuation | 65746 | 1.5% |
| Dash Punctuation | 2190 | < 0.1% |
| Decimal Number | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 305132 | |
| a | 299337 | |
| e | 290066 | |
| o | 240179 | |
| r | 229270 | |
| i | 173763 | 6.7% |
| n | 169850 | 6.6% |
| l | 136863 | 5.3% |
| d | 115259 | 4.5% |
| s | 95753 | 3.7% |
| Other values (25) | 532100 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 116393 | |
| M | 90618 | 10.3% |
| N | 79756 | 9.1% |
| B | 56903 | 6.5% |
| C | 54336 | 6.2% |
| L | 51912 | 5.9% |
| D | 47327 | 5.4% |
| J | 42554 | 4.8% |
| W | 40148 | 4.6% |
| G | 38218 | 4.4% |
| Other values (17) | 260055 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 5 | 5 | |
| 2 | 2 | 9.5% |
| 6 | 2 | 9.5% |
| 0 | 2 | 9.5% |
| 9 | 1 | 4.8% |
| 3 | 1 | 4.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 355530 | |
| & | 39866 | 9.8% |
| , | 9359 | 2.3% |
| ' | 622 | 0.2% |
| ? | 16 | < 0.1% |
| / | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 65735 | |
| ( | 10 | < 0.1% |
| { | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 65735 | |
| ) | 10 | < 0.1% |
| } | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 512371 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2190 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3465792 | |
| Common | 1051468 | 23.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 305132 | 8.8% |
| a | 299337 | 8.6% |
| e | 290066 | 8.4% |
| o | 240179 | 6.9% |
| r | 229270 | 6.6% |
| i | 173763 | 5.0% |
| n | 169850 | 4.9% |
| l | 136863 | 3.9% |
| S | 116393 | 3.4% |
| d | 115259 | 3.3% |
| Other values (52) | 1389680 |
Common
| Value | Count | Frequency (%) |
| 512371 | ||
| . | 355530 | |
| [ | 65735 | 6.3% |
| ] | 65735 | 6.3% |
| & | 39866 | 3.8% |
| , | 9359 | 0.9% |
| - | 2190 | 0.2% |
| ' | 622 | 0.1% |
| ? | 16 | < 0.1% |
| ( | 10 | < 0.1% |
| Other values (11) | 34 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4516770 | |
| None | 490 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 512371 | 11.3% | |
| . | 355530 | 7.9% |
| t | 305132 | 6.8% |
| a | 299337 | 6.6% |
| e | 290066 | 6.4% |
| o | 240179 | 5.3% |
| r | 229270 | 5.1% |
| i | 173763 | 3.8% |
| n | 169850 | 3.8% |
| l | 136863 | 3.0% |
| Other values (63) | 1804409 |
None
| Value | Count | Frequency (%) |
| ñ | 238 | |
| ü | 107 | |
| á | 95 | 19.4% |
| ä | 13 | 2.7% |
| ö | 12 | 2.4% |
| é | 12 | 2.4% |
| ó | 8 | 1.6% |
| Á | 2 | 0.4% |
| č | 2 | 0.4% |
| â | 1 | 0.2% |
individualCount
Text
| Distinct | 941 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3136 |
| Missing (%) | 0.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 1 |
| Mean length | 1.044865251 |
| Min length | 1 |
Unique
| Unique | 393 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 7 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 548219 | |
| 2 | 10272 | 1.7% |
| 3 | 6617 | 1.1% |
| 4 | 4294 | 0.7% |
| 5 | 2621 | 0.4% |
| 6 | 2340 | 0.4% |
| 7 | 1822 | 0.3% |
| 8 | 1526 | 0.3% |
| 10 | 1306 | 0.2% |
| 9 | 1254 | 0.2% |
| Other values (931) | 21219 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 560802 | |
| 2 | 17644 | 2.8% |
| 3 | 11797 | 1.9% |
| 4 | 8334 | 1.3% |
| 5 | 6510 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4419 | 0.7% |
| 8 | 3991 | 0.6% |
| 9 | 3488 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 628476 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 560802 | |
| 2 | 17644 | 2.8% |
| 3 | 11797 | 1.9% |
| 4 | 8334 | 1.3% |
| 5 | 6510 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4419 | 0.7% |
| 8 | 3991 | 0.6% |
| 9 | 3488 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 628476 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 560802 | |
| 2 | 17644 | 2.8% |
| 3 | 11797 | 1.9% |
| 4 | 8334 | 1.3% |
| 5 | 6510 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4419 | 0.7% |
| 8 | 3991 | 0.6% |
| 9 | 3488 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 628476 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 560802 | |
| 2 | 17644 | 2.8% |
| 3 | 11797 | 1.9% |
| 4 | 8334 | 1.3% |
| 5 | 6510 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4419 | 0.7% |
| 8 | 3991 | 0.6% |
| 9 | 3488 | 0.6% |
sex
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 384462 |
| Missing (%) | 63.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.79924965 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | MALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | FEMALE |
| Value | Count | Frequency (%) |
| male | 132181 | |
| female | 87983 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 308147 | |
| M | 220164 | |
| A | 220164 | |
| L | 220164 | |
| F | 87983 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1056622 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 308147 | |
| M | 220164 | |
| A | 220164 | |
| L | 220164 | |
| F | 87983 | 8.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1056622 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 308147 | |
| M | 220164 | |
| A | 220164 | |
| L | 220164 | |
| F | 87983 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1056622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 308147 | |
| M | 220164 | |
| A | 220164 | |
| L | 220164 | |
| F | 87983 | 8.3% |
lifeStage
Text
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 184129 |
| Missing (%) | 30.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.02011905 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Adult |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 415182 | |
| immature | 2800 | 0.7% |
| pupa | 946 | 0.2% |
| larva | 886 | 0.2% |
| unknown | 490 | 0.1% |
| nymph | 139 | < 0.1% |
| egg | 34 | < 0.1% |
| deutonymph | 17 | < 0.1% |
| juvenile | 2 | < 0.1% |
| subadult | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 418949 | |
| t | 418000 | |
| l | 415185 | |
| d | 415183 | |
| A | 415182 | |
| m | 5756 | 0.3% |
| a | 5519 | 0.3% |
| r | 3686 | 0.2% |
| e | 2821 | 0.1% |
| I | 2800 | 0.1% |
| Other values (19) | 7864 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1690448 | |
| Uppercase Letter | 420497 | 19.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 418949 | |
| t | 418000 | |
| l | 415185 | |
| d | 415183 | |
| m | 5756 | 0.3% |
| a | 5519 | 0.3% |
| r | 3686 | 0.2% |
| e | 2821 | 0.2% |
| n | 1489 | 0.1% |
| p | 1102 | 0.1% |
| Other values (9) | 2758 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 415182 | |
| I | 2800 | 0.7% |
| P | 946 | 0.2% |
| L | 886 | 0.2% |
| U | 490 | 0.1% |
| N | 139 | < 0.1% |
| E | 34 | < 0.1% |
| D | 17 | < 0.1% |
| J | 2 | < 0.1% |
| S | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2110945 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 418949 | |
| t | 418000 | |
| l | 415185 | |
| d | 415183 | |
| A | 415182 | |
| m | 5756 | 0.3% |
| a | 5519 | 0.3% |
| r | 3686 | 0.2% |
| e | 2821 | 0.1% |
| I | 2800 | 0.1% |
| Other values (19) | 7864 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2110945 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 418949 | |
| t | 418000 | |
| l | 415185 | |
| d | 415183 | |
| A | 415182 | |
| m | 5756 | 0.3% |
| a | 5519 | 0.3% |
| r | 3686 | 0.2% |
| e | 2821 | 0.1% |
| I | 2800 | 0.1% |
| Other values (19) | 7864 | 0.4% |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 604626 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1209252 | |
| P | 604626 | |
| R | 604626 | |
| S | 604626 | |
| N | 604626 | |
| T | 604626 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4232382 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1209252 | |
| P | 604626 | |
| R | 604626 | |
| S | 604626 | |
| N | 604626 | |
| T | 604626 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4232382 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1209252 | |
| P | 604626 | |
| R | 604626 | |
| S | 604626 | |
| N | 604626 | |
| T | 604626 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4232382 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1209252 | |
| P | 604626 | |
| R | 604626 | |
| S | 604626 | |
| N | 604626 | |
| T | 604626 |
preparations
Text
Missing 
| Distinct | 272 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 42051 |
| Missing (%) | 7.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 93 |
|---|---|
| Median length | 6 |
| Mean length | 6.839850687 |
| Min length | 1 |
Unique
| Unique | 112 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Pinned |
|---|---|
| 2nd row | Pinned |
| 3rd row | Pinned |
| 4th row | Envelope |
| 5th row | Pinned |
| Value | Count | Frequency (%) |
| pinned | 389733 | |
| envelope | 114672 | 18.8% |
| slide | 65056 | 10.7% |
| vial | 9495 | 1.6% |
| ethanol | 6481 | 1.1% |
| section | 3746 | 0.6% |
| on | 3653 | 0.6% |
| 3195 | 0.5% | |
| ink | 3151 | 0.5% |
| pen | 3072 | 0.5% |
| Other values (93) | 7800 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 916431 | |
| e | 701114 | |
| i | 472644 | |
| d | 455886 | |
| P | 366191 | 9.5% |
| l | 199752 | 5.2% |
| p | 142785 | 3.7% |
| o | 133876 | 3.5% |
| v | 114834 | 3.0% |
| E | 112885 | 2.9% |
| Other values (48) | 231531 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3214047 | |
| Uppercase Letter | 553344 | 14.4% |
| Space Separator | 47479 | 1.2% |
| Other Punctuation | 32278 | 0.8% |
| Decimal Number | 781 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 916431 | |
| e | 701114 | |
| i | 472644 | |
| d | 455886 | |
| l | 199752 | 6.2% |
| p | 142785 | 4.4% |
| o | 133876 | 4.2% |
| v | 114834 | 3.6% |
| a | 18594 | 0.6% |
| s | 17524 | 0.5% |
| Other values (15) | 40607 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 366191 | |
| E | 112885 | 20.4% |
| S | 56001 | 10.1% |
| V | 9715 | 1.8% |
| I | 3164 | 0.6% |
| B | 2575 | 0.5% |
| R | 887 | 0.2% |
| M | 523 | 0.1% |
| C | 505 | 0.1% |
| D | 388 | 0.1% |
| Other values (10) | 510 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 28578 | |
| & | 3195 | 9.9% |
| % | 389 | 1.2% |
| . | 69 | 0.2% |
| , | 28 | 0.1% |
| / | 15 | < 0.1% |
| ? | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 389 | |
| 7 | 389 | |
| 2 | 1 | 0.1% |
| 3 | 1 | 0.1% |
| 9 | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 47479 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3767391 | |
| Common | 80538 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 916431 | |
| e | 701114 | |
| i | 472644 | |
| d | 455886 | |
| P | 366191 | 9.7% |
| l | 199752 | 5.3% |
| p | 142785 | 3.8% |
| o | 133876 | 3.6% |
| v | 114834 | 3.0% |
| E | 112885 | 3.0% |
| Other values (35) | 150993 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 47479 | ||
| ; | 28578 | |
| & | 3195 | 4.0% |
| 5 | 389 | 0.5% |
| % | 389 | 0.5% |
| 7 | 389 | 0.5% |
| . | 69 | 0.1% |
| , | 28 | < 0.1% |
| / | 15 | < 0.1% |
| ? | 4 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3847929 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 916431 | |
| e | 701114 | |
| i | 472644 | |
| d | 455886 | |
| P | 366191 | 9.5% |
| l | 199752 | 5.2% |
| p | 142785 | 3.7% |
| o | 133876 | 3.5% |
| v | 114834 | 3.0% |
| E | 112885 | 2.9% |
| Other values (48) | 231531 | 6.0% |
Missing 
| Distinct | 31232 |
|---|---|
| Distinct (%) | 21.5% |
| Missing | 459276 |
| Missing (%) | 76.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 367359 |
|---|---|
| Median length | 152440 |
| Mean length | 80.11453732 |
| Min length | 1 |
Unique
| Unique | 27501 ? |
|---|---|
| Unique (%) | 18.9% |
Sample
| 1st row | One leg removed for genetic sampling while on loan to GUELPH. |
|---|---|
| 2nd row | Lindroth, 1975:125: (the loc. is no doubt wrong). |
| 3rd row | F. Monros Coll. 1959 G.M. Greene Coll. C. Schaeffer Coll. Shoemaker Coll. 1956 A. Nicolay Coll. 1950 L.W. Saylor Coll. |
| 4th row | Specimen data is incomplete. Phase 1 of data capture inlcluded USNMENT#s and general locality. |
| 5th row | One leg removed for genetic sampling while on loan to GUELPH. |
| Value | Count | Frequency (%) |
| digitization | 56218 | 3.3% |
| by | 48162 | 2.8% |
| digital | 44075 | 2.6% |
| volunteers | 44039 | 2.6% |
| transcribed | 44039 | 2.6% |
| of | 43241 | 2.6% |
| on | 41034 | 2.4% |
| to | 36796 | 2.2% |
| loan | 36495 | 2.2% |
| for | 36258 | 2.1% |
| Other values (49844) | 1263433 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1504787 | 12.9% | |
| e | 838841 | 7.2% |
| i | 811548 | 7.0% |
| a | 687048 | 5.9% |
| t | 675294 | 5.8% |
| o | 659287 | 5.7% |
| n | 620298 | 5.3% |
| r | 558541 | 4.8% |
| s | 454981 | 3.9% |
| l | 435458 | 3.7% |
| Other values (116) | 4398565 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8072373 | |
| Space Separator | 1504787 | 12.9% |
| Uppercase Letter | 1147070 | 9.9% |
| Decimal Number | 347393 | 3.0% |
| Other Punctuation | 306192 | 2.6% |
| Control | 139457 | 1.2% |
| Open Punctuation | 40059 | 0.3% |
| Close Punctuation | 40034 | 0.3% |
| Dash Punctuation | 25457 | 0.2% |
| Math Symbol | 11915 | 0.1% |
| Other values (7) | 9911 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 838841 | |
| i | 811548 | |
| a | 687048 | 8.5% |
| t | 675294 | 8.4% |
| o | 659287 | 8.2% |
| n | 620298 | 7.7% |
| r | 558541 | 6.9% |
| s | 454981 | 5.6% |
| l | 435458 | 5.4% |
| d | 321203 | 4.0% |
| Other values (30) | 2009874 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 119799 | 10.4% |
| S | 109393 | 9.5% |
| O | 106766 | 9.3% |
| E | 97854 | 8.5% |
| D | 76330 | 6.7% |
| I | 72682 | 6.3% |
| T | 72597 | 6.3% |
| M | 68130 | 5.9% |
| U | 60048 | 5.2% |
| L | 53055 | 4.6% |
| Other values (21) | 310416 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 176603 | |
| ; | 48088 | 15.7% |
| , | 34078 | 11.1% |
| : | 20222 | 6.6% |
| # | 9313 | 3.0% |
| / | 6742 | 2.2% |
| ' | 5289 | 1.7% |
| " | 3442 | 1.1% |
| & | 1718 | 0.6% |
| ? | 602 | 0.2% |
| Other values (7) | 95 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 70694 | |
| 9 | 43824 | |
| 2 | 42022 | |
| 0 | 39802 | |
| 3 | 27396 | 7.9% |
| 4 | 27182 | 7.8% |
| 5 | 26126 | 7.5% |
| 6 | 25057 | 7.2% |
| 8 | 23473 | 6.8% |
| 7 | 21817 | 6.3% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 10552 | |
| + | 720 | 6.0% |
| = | 620 | 5.2% |
| > | 11 | 0.1% |
| ~ | 8 | 0.1% |
| < | 4 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ♂ | 14 | |
| ° | 7 | |
| ♀ | 4 | 15.4% |
| © | 1 | 3.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 33853 | |
| [ | 6195 | 15.5% |
| { | 11 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 33841 | |
| ] | 6182 | 15.4% |
| } | 11 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 138832 | ||
| 625 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25456 | |
| — | 1 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 | |
| £ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1504787 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 9827 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 23 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 23 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 9 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9219441 | |
| Common | 2425207 | 20.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 838841 | 9.1% |
| i | 811548 | 8.8% |
| a | 687048 | 7.5% |
| t | 675294 | 7.3% |
| o | 659287 | 7.2% |
| n | 620298 | 6.7% |
| r | 558541 | 6.1% |
| s | 454981 | 4.9% |
| l | 435458 | 4.7% |
| d | 321203 | 3.5% |
| Other values (60) | 3156942 |
Common
| Value | Count | Frequency (%) |
| 1504787 | ||
| . | 176603 | 7.3% |
| 138832 | 5.7% | |
| 1 | 70694 | 2.9% |
| ; | 48088 | 2.0% |
| 9 | 43824 | 1.8% |
| 2 | 42022 | 1.7% |
| 0 | 39802 | 1.6% |
| , | 34078 | 1.4% |
| ( | 33853 | 1.4% |
| Other values (46) | 292624 | 12.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11644446 | |
| None | 134 | < 0.1% |
| Punctuation | 49 | < 0.1% |
| Misc Symbols | 18 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1504787 | 12.9% | |
| e | 838841 | 7.2% |
| i | 811548 | 7.0% |
| a | 687048 | 5.9% |
| t | 675294 | 5.8% |
| o | 659287 | 5.7% |
| n | 620298 | 5.3% |
| r | 558541 | 4.8% |
| s | 454981 | 3.9% |
| l | 435458 | 3.7% |
| Other values (85) | 4398363 |
None
| Value | Count | Frequency (%) |
| é | 32 | |
| á | 22 | |
| ü | 20 | |
| í | 9 | 6.7% |
| ó | 8 | 6.0% |
| · | 7 | 5.2% |
| ° | 7 | 5.2% |
| ö | 5 | 3.7% |
| ø | 3 | 2.2% |
| É | 2 | 1.5% |
| Other values (14) | 19 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 23 | |
| ” | 23 | |
| … | 2 | 4.1% |
| — | 1 | 2.0% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 14 | |
| ♀ | 4 | 22.2% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 1 |
verbatimLabel
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -11.7815 |
|---|
| Value | Count | Frequency (%) |
| 11.7815 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3 | |
| - | 1 | 12.5% |
| . | 1 | 12.5% |
| 7 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| 5 | 1 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Dash Punctuation | 1 | 12.5% |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 7 | 1 | 16.7% |
| 8 | 1 | 16.7% |
| 5 | 1 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3 | |
| - | 1 | 12.5% |
| . | 1 | 12.5% |
| 7 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| 5 | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3 | |
| - | 1 | 12.5% |
| . | 1 | 12.5% |
| 7 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| 5 | 1 | 12.5% |
materialSampleID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -76.7017 |
|---|
| Value | Count | Frequency (%) |
| 76.7017 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 3 | |
| - | 1 | 12.5% |
| 6 | 1 | 12.5% |
| . | 1 | 12.5% |
| 0 | 1 | 12.5% |
| 1 | 1 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Dash Punctuation | 1 | 12.5% |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 3 | |
| 6 | 1 | 16.7% |
| 0 | 1 | 16.7% |
| 1 | 1 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 3 | |
| - | 1 | 12.5% |
| 6 | 1 | 12.5% |
| . | 1 | 12.5% |
| 0 | 1 | 12.5% |
| 1 | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 3 | |
| - | 1 | 12.5% |
| 6 | 1 | 12.5% |
| . | 1 | 12.5% |
| 0 | 1 | 12.5% |
| 1 | 1 | 12.5% |
fieldNumber
Text
Missing 
| Distinct | 3091 |
|---|---|
| Distinct (%) | 72.7% |
| Missing | 600377 |
| Missing (%) | 99.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 9.591433278 |
| Min length | 1 |
Unique
| Unique | 2646 ? |
|---|---|
| Unique (%) | 62.3% |
Sample
| 1st row | BBB991 |
|---|---|
| 2nd row | BBB642-DERM |
| 3rd row | 1653 |
| 4th row | JSL021109-18 |
| 5th row | COL-8-101 |
| Value | Count | Frequency (%) |
| 1653 | 128 | 2.8% |
| 2 | 46 | 1.0% |
| bbb899-hym | 34 | 0.7% |
| 1 | 32 | 0.7% |
| bbb791-hym | 25 | 0.5% |
| bbb749-hym | 23 | 0.5% |
| 759-8 | 22 | 0.5% |
| tub | 20 | 0.4% |
| tank | 18 | 0.4% |
| 9 | 18 | 0.4% |
| Other values (3087) | 4225 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 4781 | 11.7% |
| 0 | 3995 | 9.8% |
| - | 3976 | 9.8% |
| 1 | 3398 | 8.3% |
| 2 | 2238 | 5.5% |
| 3 | 1558 | 3.8% |
| 6 | 1541 | 3.8% |
| 7 | 1509 | 3.7% |
| 4 | 1498 | 3.7% |
| 9 | 1481 | 3.6% |
| Other values (60) | 14779 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19488 | |
| Uppercase Letter | 15048 | |
| Dash Punctuation | 3976 | 9.8% |
| Lowercase Letter | 1242 | 3.0% |
| Other Punctuation | 654 | 1.6% |
| Space Separator | 342 | 0.8% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 4781 | |
| S | 1388 | 9.2% |
| T | 1136 | 7.5% |
| C | 792 | 5.3% |
| M | 763 | 5.1% |
| A | 707 | 4.7% |
| L | 667 | 4.4% |
| R | 639 | 4.2% |
| N | 583 | 3.9% |
| H | 532 | 3.5% |
| Other values (15) | 3060 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 146 | |
| a | 138 | |
| o | 134 | |
| t | 118 | 9.5% |
| b | 82 | 6.6% |
| n | 81 | 6.5% |
| r | 67 | 5.4% |
| m | 57 | 4.6% |
| c | 57 | 4.6% |
| i | 55 | 4.4% |
| Other values (13) | 307 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3995 | |
| 1 | 3398 | |
| 2 | 2238 | |
| 3 | 1558 | 8.0% |
| 6 | 1541 | 7.9% |
| 7 | 1509 | 7.7% |
| 4 | 1498 | 7.7% |
| 9 | 1481 | 7.6% |
| 5 | 1174 | 6.0% |
| 8 | 1096 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 344 | |
| . | 199 | |
| ; | 93 | 14.2% |
| , | 10 | 1.5% |
| ' | 3 | 0.5% |
| " | 3 | 0.5% |
| / | 1 | 0.2% |
| : | 1 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3976 |
Space Separator
| Value | Count | Frequency (%) |
| 342 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24464 | |
| Latin | 16290 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 4781 | |
| S | 1388 | 8.5% |
| T | 1136 | 7.0% |
| C | 792 | 4.9% |
| M | 763 | 4.7% |
| A | 707 | 4.3% |
| L | 667 | 4.1% |
| R | 639 | 3.9% |
| N | 583 | 3.6% |
| H | 532 | 3.3% |
| Other values (38) | 4302 |
Common
| Value | Count | Frequency (%) |
| 0 | 3995 | |
| - | 3976 | |
| 1 | 3398 | |
| 2 | 2238 | |
| 3 | 1558 | 6.4% |
| 6 | 1541 | 6.3% |
| 7 | 1509 | 6.2% |
| 4 | 1498 | 6.1% |
| 9 | 1481 | 6.1% |
| 5 | 1174 | 4.8% |
| Other values (12) | 2096 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 4781 | 11.7% |
| 0 | 3995 | 9.8% |
| - | 3976 | 9.8% |
| 1 | 3398 | 8.3% |
| 2 | 2238 | 5.5% |
| 3 | 1558 | 3.8% |
| 6 | 1541 | 3.8% |
| 7 | 1509 | 3.7% |
| 4 | 1498 | 3.7% |
| 9 | 1481 | 3.6% |
| Other values (60) | 14779 |
eventDate
Text
Missing 
| Distinct | 45561 |
|---|---|
| Distinct (%) | 12.5% |
| Missing | 239769 |
| Missing (%) | 39.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 10.99102388 |
| Min length | 4 |
Unique
| Unique | 12880 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | 1967-06-20 |
|---|---|
| 2nd row | 1914-07 |
| 3rd row | 2005-08-02 |
| 4th row | 1964-04-25 |
| 5th row | 1971-08-22 |
| Value | Count | Frequency (%) |
| 1998-07-26 | 709 | 0.2% |
| 1938 | 599 | 0.2% |
| 1896 | 545 | 0.1% |
| 2006-06-24 | 544 | 0.1% |
| 1933 | 543 | 0.1% |
| 1960-06-30 | 506 | 0.1% |
| 1930 | 495 | 0.1% |
| 1936 | 490 | 0.1% |
| 1927-07-10 | 469 | 0.1% |
| 1964-08-01/1964-08-31 | 449 | 0.1% |
| Other values (45551) | 359508 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 776958 | |
| 1 | 696249 | |
| 0 | 648003 | |
| 9 | 488959 | |
| 2 | 286183 | 7.1% |
| 6 | 224282 | 5.6% |
| 7 | 215676 | 5.4% |
| 8 | 182043 | 4.5% |
| 5 | 158527 | 4.0% |
| 3 | 154598 | 3.9% |
| Other values (2) | 178674 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3189224 | |
| Dash Punctuation | 776958 | 19.4% |
| Other Punctuation | 43970 | 1.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 696249 | |
| 0 | 648003 | |
| 9 | 488959 | |
| 2 | 286183 | |
| 6 | 224282 | 7.0% |
| 7 | 215676 | 6.8% |
| 8 | 182043 | 5.7% |
| 5 | 158527 | 5.0% |
| 3 | 154598 | 4.8% |
| 4 | 134704 | 4.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 776958 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 43970 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4010152 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 776958 | |
| 1 | 696249 | |
| 0 | 648003 | |
| 9 | 488959 | |
| 2 | 286183 | 7.1% |
| 6 | 224282 | 5.6% |
| 7 | 215676 | 5.4% |
| 8 | 182043 | 4.5% |
| 5 | 158527 | 4.0% |
| 3 | 154598 | 3.9% |
| Other values (2) | 178674 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4010152 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 776958 | |
| 1 | 696249 | |
| 0 | 648003 | |
| 9 | 488959 | |
| 2 | 286183 | 7.1% |
| 6 | 224282 | 5.6% |
| 7 | 215676 | 5.4% |
| 8 | 182043 | 4.5% |
| 5 | 158527 | 4.0% |
| 3 | 154598 | 3.9% |
| Other values (2) | 178674 | 4.5% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 270965 |
| Missing (%) | 44.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.85075271 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 171 |
|---|---|
| 2nd row | 214 |
| 3rd row | 116 |
| 4th row | 234 |
| 5th row | 157 |
| Value | Count | Frequency (%) |
| 182 | 3298 | 1.0% |
| 183 | 2901 | 0.9% |
| 191 | 2876 | 0.9% |
| 207 | 2734 | 0.8% |
| 213 | 2713 | 0.8% |
| 178 | 2623 | 0.8% |
| 214 | 2602 | 0.8% |
| 172 | 2574 | 0.8% |
| 189 | 2556 | 0.8% |
| 218 | 2541 | 0.8% |
| Other values (356) | 306243 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 221414 | |
| 2 | 186945 | |
| 3 | 89297 | |
| 9 | 66624 | 7.0% |
| 8 | 65748 | 6.9% |
| 0 | 65602 | 6.9% |
| 6 | 64712 | 6.8% |
| 7 | 63778 | 6.7% |
| 5 | 63690 | 6.7% |
| 4 | 63375 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 951185 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 221414 | |
| 2 | 186945 | |
| 3 | 89297 | |
| 9 | 66624 | 7.0% |
| 8 | 65748 | 6.9% |
| 0 | 65602 | 6.9% |
| 6 | 64712 | 6.8% |
| 7 | 63778 | 6.7% |
| 5 | 63690 | 6.7% |
| 4 | 63375 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 951185 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 221414 | |
| 2 | 186945 | |
| 3 | 89297 | |
| 9 | 66624 | 7.0% |
| 8 | 65748 | 6.9% |
| 0 | 65602 | 6.9% |
| 6 | 64712 | 6.8% |
| 7 | 63778 | 6.7% |
| 5 | 63690 | 6.7% |
| 4 | 63375 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 951185 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 221414 | |
| 2 | 186945 | |
| 3 | 89297 | |
| 9 | 66624 | 7.0% |
| 8 | 65748 | 6.9% |
| 0 | 65602 | 6.9% |
| 6 | 64712 | 6.8% |
| 7 | 63778 | 6.7% |
| 5 | 63690 | 6.7% |
| 4 | 63375 | 6.7% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 270965 |
| Missing (%) | 44.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.860076545 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 171 |
|---|---|
| 2nd row | 214 |
| 3rd row | 116 |
| 4th row | 234 |
| 5th row | 157 |
| Value | Count | Frequency (%) |
| 207 | 2989 | 0.9% |
| 191 | 2948 | 0.9% |
| 197 | 2758 | 0.8% |
| 212 | 2710 | 0.8% |
| 182 | 2684 | 0.8% |
| 178 | 2598 | 0.8% |
| 181 | 2581 | 0.8% |
| 196 | 2566 | 0.8% |
| 172 | 2491 | 0.7% |
| 208 | 2488 | 0.7% |
| Other values (356) | 306848 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 220893 | |
| 2 | 187359 | |
| 3 | 90241 | |
| 9 | 67299 | 7.1% |
| 0 | 66539 | 7.0% |
| 7 | 65376 | 6.9% |
| 8 | 65012 | 6.8% |
| 6 | 64562 | 6.8% |
| 5 | 63697 | 6.7% |
| 4 | 63318 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 954296 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 220893 | |
| 2 | 187359 | |
| 3 | 90241 | |
| 9 | 67299 | 7.1% |
| 0 | 66539 | 7.0% |
| 7 | 65376 | 6.9% |
| 8 | 65012 | 6.8% |
| 6 | 64562 | 6.8% |
| 5 | 63697 | 6.7% |
| 4 | 63318 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 954296 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 220893 | |
| 2 | 187359 | |
| 3 | 90241 | |
| 9 | 67299 | 7.1% |
| 0 | 66539 | 7.0% |
| 7 | 65376 | 6.9% |
| 8 | 65012 | 6.8% |
| 6 | 64562 | 6.8% |
| 5 | 63697 | 6.7% |
| 4 | 63318 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 954296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 220893 | |
| 2 | 187359 | |
| 3 | 90241 | |
| 9 | 67299 | 7.1% |
| 0 | 66539 | 7.0% |
| 7 | 65376 | 6.9% |
| 8 | 65012 | 6.8% |
| 6 | 64562 | 6.8% |
| 5 | 63697 | 6.7% |
| 4 | 63318 | 6.6% |
year
Text
Missing 
| Distinct | 190 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 240229 |
| Missing (%) | 39.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1967 |
|---|---|
| 2nd row | 1914 |
| 3rd row | 2005 |
| 4th row | 1964 |
| 5th row | 1971 |
| Value | Count | Frequency (%) |
| 1966 | 12303 | 3.4% |
| 1968 | 9189 | 2.5% |
| 1971 | 8968 | 2.5% |
| 1967 | 8355 | 2.3% |
| 1965 | 7870 | 2.2% |
| 1972 | 6272 | 1.7% |
| 1964 | 6145 | 1.7% |
| 1974 | 6095 | 1.7% |
| 1973 | 6077 | 1.7% |
| 1963 | 5552 | 1.5% |
| Other values (180) | 287571 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 397082 | |
| 9 | 381004 | |
| 6 | 108661 | 7.5% |
| 0 | 107799 | 7.4% |
| 2 | 92602 | 6.4% |
| 7 | 89152 | 6.1% |
| 8 | 74474 | 5.1% |
| 5 | 72350 | 5.0% |
| 3 | 69496 | 4.8% |
| 4 | 64968 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1457588 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 397082 | |
| 9 | 381004 | |
| 6 | 108661 | 7.5% |
| 0 | 107799 | 7.4% |
| 2 | 92602 | 6.4% |
| 7 | 89152 | 6.1% |
| 8 | 74474 | 5.1% |
| 5 | 72350 | 5.0% |
| 3 | 69496 | 4.8% |
| 4 | 64968 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1457588 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 397082 | |
| 9 | 381004 | |
| 6 | 108661 | 7.5% |
| 0 | 107799 | 7.4% |
| 2 | 92602 | 6.4% |
| 7 | 89152 | 6.1% |
| 8 | 74474 | 5.1% |
| 5 | 72350 | 5.0% |
| 3 | 69496 | 4.8% |
| 4 | 64968 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1457588 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 397082 | |
| 9 | 381004 | |
| 6 | 108661 | 7.5% |
| 0 | 107799 | 7.4% |
| 2 | 92602 | 6.4% |
| 7 | 89152 | 6.1% |
| 8 | 74474 | 5.1% |
| 5 | 72350 | 5.0% |
| 3 | 69496 | 4.8% |
| 4 | 64968 | 4.5% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 254573 |
| Missing (%) | 42.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.112948611 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 7 |
| 3rd row | 8 |
| 4th row | 4 |
| 5th row | 8 |
| Value | Count | Frequency (%) |
| 7 | 73156 | |
| 6 | 58086 | |
| 8 | 51402 | |
| 5 | 35620 | |
| 9 | 25573 | 7.3% |
| 4 | 24539 | 7.0% |
| 3 | 16420 | 4.7% |
| 10 | 16139 | 4.6% |
| 2 | 13949 | 4.0% |
| 11 | 13286 | 3.8% |
| Other values (2) | 21883 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 73156 | |
| 1 | 64594 | |
| 6 | 58086 | |
| 8 | 51402 | |
| 5 | 35620 | |
| 9 | 25573 | 6.6% |
| 4 | 24539 | 6.3% |
| 2 | 24062 | 6.2% |
| 3 | 16420 | 4.2% |
| 0 | 16139 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 389591 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 73156 | |
| 1 | 64594 | |
| 6 | 58086 | |
| 8 | 51402 | |
| 5 | 35620 | |
| 9 | 25573 | 6.6% |
| 4 | 24539 | 6.3% |
| 2 | 24062 | 6.2% |
| 3 | 16420 | 4.2% |
| 0 | 16139 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 389591 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 73156 | |
| 1 | 64594 | |
| 6 | 58086 | |
| 8 | 51402 | |
| 5 | 35620 | |
| 9 | 25573 | 6.6% |
| 4 | 24539 | 6.3% |
| 2 | 24062 | 6.2% |
| 3 | 16420 | 4.2% |
| 0 | 16139 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 389591 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 73156 | |
| 1 | 64594 | |
| 6 | 58086 | |
| 8 | 51402 | |
| 5 | 35620 | |
| 9 | 25573 | 6.6% |
| 4 | 24539 | 6.3% |
| 2 | 24062 | 6.2% |
| 3 | 16420 | 4.2% |
| 0 | 16139 | 4.1% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 314935 |
| Missing (%) | 52.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.709435226 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 20 |
|---|---|
| 2nd row | 2 |
| 3rd row | 25 |
| 4th row | 22 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 8 | 11096 | 3.8% |
| 20 | 10742 | 3.7% |
| 10 | 10614 | 3.7% |
| 1 | 10586 | 3.7% |
| 12 | 10579 | 3.7% |
| 15 | 10517 | 3.6% |
| 26 | 9863 | 3.4% |
| 25 | 9824 | 3.4% |
| 16 | 9809 | 3.4% |
| 14 | 9721 | 3.4% |
| Other values (21) | 186340 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 132241 | |
| 2 | 123511 | |
| 3 | 40350 | 8.1% |
| 0 | 29762 | 6.0% |
| 8 | 29425 | 5.9% |
| 6 | 29321 | 5.9% |
| 5 | 28696 | 5.8% |
| 4 | 28235 | 5.7% |
| 7 | 27405 | 5.5% |
| 9 | 26262 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 495208 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 132241 | |
| 2 | 123511 | |
| 3 | 40350 | 8.1% |
| 0 | 29762 | 6.0% |
| 8 | 29425 | 5.9% |
| 6 | 29321 | 5.9% |
| 5 | 28696 | 5.8% |
| 4 | 28235 | 5.7% |
| 7 | 27405 | 5.5% |
| 9 | 26262 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 495208 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 132241 | |
| 2 | 123511 | |
| 3 | 40350 | 8.1% |
| 0 | 29762 | 6.0% |
| 8 | 29425 | 5.9% |
| 6 | 29321 | 5.9% |
| 5 | 28696 | 5.8% |
| 4 | 28235 | 5.7% |
| 7 | 27405 | 5.5% |
| 9 | 26262 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 495208 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 132241 | |
| 2 | 123511 | |
| 3 | 40350 | 8.1% |
| 0 | 29762 | 6.0% |
| 8 | 29425 | 5.9% |
| 6 | 29321 | 5.9% |
| 5 | 28696 | 5.8% |
| 4 | 28235 | 5.7% |
| 7 | 27405 | 5.5% |
| 9 | 26262 | 5.3% |
Missing 
| Distinct | 67985 |
|---|---|
| Distinct (%) | 32.6% |
| Missing | 396306 |
| Missing (%) | 65.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 79 |
|---|---|
| Median length | 71 |
| Mean length | 10.59670219 |
| Min length | 1 |
Unique
| Unique | 51573 ? |
|---|---|
| Unique (%) | 24.8% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | 2-Aug-2005 |
| 3rd row | [Not Stated] |
| 4th row | [Not Stated] |
| 5th row | 9-IX-78 |
| Value | Count | Frequency (%) |
| not | 32197 | 8.2% |
| stated | 32165 | 8.2% |
| july | 8706 | 2.2% |
| aug | 7740 | 2.0% |
| june | 7233 | 1.8% |
| may | 5957 | 1.5% |
| 1968 | 5763 | 1.5% |
| 1971 | 5705 | 1.5% |
| 1966 | 4507 | 1.1% |
| 1972 | 2977 | 0.8% |
| Other values (37313) | 279737 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 217306 | 9.8% |
| 184367 | 8.4% | |
| 9 | 146678 | 6.6% |
| - | 127695 | 5.8% |
| 2 | 112927 | 5.1% |
| t | 105528 | 4.8% |
| I | 88868 | 4.0% |
| 6 | 79315 | 3.6% |
| 0 | 76302 | 3.5% |
| . | 64856 | 2.9% |
| Other values (82) | 1003663 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 900761 | |
| Lowercase Letter | 464809 | |
| Uppercase Letter | 333358 | 15.1% |
| Space Separator | 184367 | 8.4% |
| Other Punctuation | 128788 | 5.8% |
| Dash Punctuation | 127731 | 5.8% |
| Open Punctuation | 33629 | 1.5% |
| Close Punctuation | 33624 | 1.5% |
| Connector Punctuation | 250 | < 0.1% |
| Math Symbol | 187 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 105528 | |
| e | 57947 | |
| a | 49161 | |
| u | 41265 | 8.9% |
| o | 39662 | 8.5% |
| d | 33264 | 7.2% |
| n | 19822 | 4.3% |
| y | 17900 | 3.9% |
| l | 17062 | 3.7% |
| r | 16875 | 3.6% |
| Other values (18) | 66323 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 88868 | |
| V | 43505 | |
| N | 38307 | |
| S | 36913 | |
| J | 33535 | 10.1% |
| A | 23441 | 7.0% |
| M | 13900 | 4.2% |
| X | 9129 | 2.7% |
| U | 7428 | 2.2% |
| E | 5306 | 1.6% |
| Other values (17) | 33026 | 9.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 64856 | |
| , | 34975 | |
| / | 22999 | 17.9% |
| ' | 5024 | 3.9% |
| : | 620 | 0.5% |
| ? | 141 | 0.1% |
| ; | 102 | 0.1% |
| & | 38 | < 0.1% |
| " | 21 | < 0.1% |
| # | 6 | < 0.1% |
| Other values (3) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 217306 | |
| 9 | 146678 | |
| 2 | 112927 | |
| 6 | 79315 | 8.8% |
| 0 | 76302 | 8.5% |
| 7 | 63754 | 7.1% |
| 3 | 54206 | 6.0% |
| 8 | 53731 | 6.0% |
| 5 | 48632 | 5.4% |
| 4 | 47910 | 5.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33541 | |
| ( | 82 | 0.2% |
| { | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33536 | |
| ) | 82 | 0.2% |
| } | 6 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 156 | |
| + | 26 | 13.9% |
| = | 5 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 127695 | |
| – | 36 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 184367 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 250 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1409338 | |
| Latin | 798167 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 105528 | |
| I | 88868 | 11.1% |
| e | 57947 | 7.3% |
| a | 49161 | 6.2% |
| V | 43505 | 5.5% |
| u | 41265 | 5.2% |
| o | 39662 | 5.0% |
| N | 38307 | 4.8% |
| S | 36913 | 4.6% |
| J | 33535 | 4.2% |
| Other values (45) | 263476 |
Common
| Value | Count | Frequency (%) |
| 1 | 217306 | |
| 184367 | ||
| 9 | 146678 | |
| - | 127695 | |
| 2 | 112927 | |
| 6 | 79315 | 5.6% |
| 0 | 76302 | 5.4% |
| . | 64856 | 4.6% |
| 7 | 63754 | 4.5% |
| 3 | 54206 | 3.8% |
| Other values (27) | 281932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2207464 | |
| Punctuation | 37 | < 0.1% |
| None | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 217306 | 9.8% |
| 184367 | 8.4% | |
| 9 | 146678 | 6.6% |
| - | 127695 | 5.8% |
| 2 | 112927 | 5.1% |
| t | 105528 | 4.8% |
| I | 88868 | 4.0% |
| 6 | 79315 | 3.6% |
| 0 | 76302 | 3.5% |
| . | 64856 | 2.9% |
| Other values (77) | 1003622 |
Punctuation
| Value | Count | Frequency (%) |
| – | 36 | |
| … | 1 | 2.7% |
None
| Value | Count | Frequency (%) |
| û | 2 | |
| Ç | 1 | |
| ÿ | 1 |
habitat
Text
Missing 
| Distinct | 89 |
|---|---|
| Distinct (%) | 44.7% |
| Missing | 604427 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 103 |
|---|---|
| Median length | 43 |
| Mean length | 19.28643216 |
| Min length | 5 |
Unique
| Unique | 64 ? |
|---|---|
| Unique (%) | 32.2% |
Sample
| 1st row | Roadside in coniferous forest |
|---|---|
| 2nd row | On a figleaf gourd |
| 3rd row | cultivated garden |
| 4th row | hammocks-dense hardwood & Palmetto forests |
| 5th row | visiting mango flowers |
| Value | Count | Frequency (%) |
| garden | 45 | 7.4% |
| cultivated | 44 | 7.3% |
| stream | 26 | 4.3% |
| on | 26 | 4.3% |
| forest | 23 | 3.8% |
| in | 19 | 3.1% |
| of | 13 | 2.1% |
| collected | 12 | 2.0% |
| at | 9 | 1.5% |
| terre | 8 | 1.3% |
| Other values (183) | 381 |
Most occurring characters
| Value | Count | Frequency (%) |
| 407 | 10.6% | |
| e | 388 | 10.1% |
| a | 308 | 8.0% |
| r | 258 | 6.7% |
| t | 250 | 6.5% |
| d | 224 | 5.8% |
| n | 223 | 5.8% |
| o | 217 | 5.7% |
| i | 190 | 5.0% |
| l | 185 | 4.8% |
| Other values (52) | 1188 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3215 | |
| Space Separator | 407 | 10.6% |
| Uppercase Letter | 126 | 3.3% |
| Other Punctuation | 51 | 1.3% |
| Decimal Number | 27 | 0.7% |
| Dash Punctuation | 6 | 0.2% |
| Close Punctuation | 3 | 0.1% |
| Open Punctuation | 3 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 388 | |
| a | 308 | 9.6% |
| r | 258 | 8.0% |
| t | 250 | 7.8% |
| d | 224 | 7.0% |
| n | 223 | 6.9% |
| o | 217 | 6.7% |
| i | 190 | 5.9% |
| l | 185 | 5.8% |
| s | 175 | 5.4% |
| Other values (15) | 797 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 28 | |
| C | 24 | |
| R | 9 | 7.1% |
| O | 9 | 7.1% |
| P | 8 | 6.3% |
| T | 7 | 5.6% |
| I | 6 | 4.8% |
| W | 5 | 4.0% |
| F | 5 | 4.0% |
| E | 4 | 3.2% |
| Other values (10) | 21 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 2 | 6 | |
| 1 | 5 | |
| 3 | 4 | |
| 8 | 2 | 7.4% |
| 5 | 1 | 3.7% |
| 7 | 1 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 19 | |
| . | 16 | |
| " | 6 | 11.8% |
| : | 5 | 9.8% |
| & | 3 | 5.9% |
| / | 2 | 3.9% |
Space Separator
| Value | Count | Frequency (%) |
| 407 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3341 | |
| Common | 497 | 12.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 388 | |
| a | 308 | 9.2% |
| r | 258 | 7.7% |
| t | 250 | 7.5% |
| d | 224 | 6.7% |
| n | 223 | 6.7% |
| o | 217 | 6.5% |
| i | 190 | 5.7% |
| l | 185 | 5.5% |
| s | 175 | 5.2% |
| Other values (35) | 923 |
Common
| Value | Count | Frequency (%) |
| 407 | ||
| , | 19 | 3.8% |
| . | 16 | 3.2% |
| 0 | 8 | 1.6% |
| " | 6 | 1.2% |
| 2 | 6 | 1.2% |
| - | 6 | 1.2% |
| 1 | 5 | 1.0% |
| : | 5 | 1.0% |
| 3 | 4 | 0.8% |
| Other values (7) | 15 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3838 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 407 | 10.6% | |
| e | 388 | 10.1% |
| a | 308 | 8.0% |
| r | 258 | 6.7% |
| t | 250 | 6.5% |
| d | 224 | 5.8% |
| n | 223 | 5.8% |
| o | 217 | 5.7% |
| i | 190 | 5.0% |
| l | 185 | 4.8% |
| Other values (52) | 1188 |
locationID
Text
Missing 
| Distinct | 185 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 603581 |
| Missing (%) | 99.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 14 |
| Mean length | 10.78947368 |
| Min length | 1 |
Unique
| Unique | 94 ? |
|---|---|
| Unique (%) | 9.0% |
Sample
| 1st row | MEI Site 97-81 |
|---|---|
| 2nd row | RD-044 |
| 3rd row | MEI Site 97-81 |
| 4th row | MEI Site 97-81 |
| 5th row | MEI Site 97-81 |
| Value | Count | Frequency (%) |
| mei | 652 | |
| site | 610 | |
| 97-81 | 301 | |
| 97-92 | 132 | 5.6% |
| 97-90 | 52 | 2.2% |
| 97-58 | 46 | 1.9% |
| 97-74 | 31 | 1.3% |
| 97-88 | 26 | 1.1% |
| 97-93 | 24 | 1.0% |
| k-m1 | 19 | 0.8% |
| Other values (195) | 479 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1327 | 11.8% | |
| - | 986 | 8.7% |
| 9 | 904 | 8.0% |
| 7 | 770 | 6.8% |
| M | 698 | 6.2% |
| I | 659 | 5.8% |
| E | 656 | 5.8% |
| t | 638 | 5.7% |
| e | 637 | 5.6% |
| i | 624 | 5.5% |
| Other values (46) | 3376 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3620 | |
| Uppercase Letter | 3287 | |
| Lowercase Letter | 2029 | |
| Space Separator | 1327 | 11.8% |
| Dash Punctuation | 986 | 8.7% |
| Other Punctuation | 26 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 698 | |
| I | 659 | |
| E | 656 | |
| S | 609 | |
| R | 278 | 8.5% |
| D | 272 | 8.3% |
| K | 20 | 0.6% |
| J | 14 | 0.4% |
| N | 11 | 0.3% |
| L | 11 | 0.3% |
| Other values (11) | 59 | 1.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 638 | |
| e | 637 | |
| i | 624 | |
| l | 27 | 1.3% |
| a | 20 | 1.0% |
| s | 20 | 1.0% |
| r | 10 | 0.5% |
| o | 8 | 0.4% |
| n | 7 | 0.3% |
| p | 7 | 0.3% |
| Other values (9) | 31 | 1.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 904 | |
| 7 | 770 | |
| 1 | 571 | |
| 8 | 458 | |
| 2 | 322 | 8.9% |
| 0 | 184 | 5.1% |
| 5 | 143 | 4.0% |
| 4 | 95 | 2.6% |
| 6 | 87 | 2.4% |
| 3 | 86 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 19 | |
| , | 5 | 19.2% |
| . | 1 | 3.8% |
| : | 1 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1327 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5959 | |
| Latin | 5316 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 698 | |
| I | 659 | |
| E | 656 | |
| t | 638 | |
| e | 637 | |
| i | 624 | |
| S | 609 | |
| R | 278 | 5.2% |
| D | 272 | 5.1% |
| l | 27 | 0.5% |
| Other values (30) | 218 | 4.1% |
Common
| Value | Count | Frequency (%) |
| 1327 | ||
| - | 986 | |
| 9 | 904 | |
| 7 | 770 | |
| 1 | 571 | |
| 8 | 458 | 7.7% |
| 2 | 322 | 5.4% |
| 0 | 184 | 3.1% |
| 5 | 143 | 2.4% |
| 4 | 95 | 1.6% |
| Other values (6) | 199 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11275 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1327 | 11.8% | |
| - | 986 | 8.7% |
| 9 | 904 | 8.0% |
| 7 | 770 | 6.8% |
| M | 698 | 6.2% |
| I | 659 | 5.8% |
| E | 656 | 5.8% |
| t | 638 | 5.7% |
| e | 637 | 5.6% |
| i | 624 | 5.5% |
| Other values (46) | 3376 |
higherGeography
Text
Missing 
| Distinct | 10596 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 156072 |
| Missing (%) | 25.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 91 |
| Mean length | 30.3893578 |
| Min length | 4 |
Unique
| Unique | 3142 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | United States, [Not Stated], [Not Stated] |
|---|---|
| 2nd row | Costa Rica, Cartago, [Not Stated] |
| 3rd row | United States, Alaska, Aleutians West |
| 4th row | United States, Virginia, Virginia Beach |
| 5th row | United States, New York, [Not Stated] |
| Value | Count | Frequency (%) |
| united | 222825 | 12.1% |
| states | 221093 | 12.1% |
| not | 167986 | 9.2% |
| stated | 167984 | 9.2% |
| california | 23408 | 1.3% |
| virginia | 23318 | 1.3% |
| new | 22501 | 1.2% |
| colorado | 21080 | 1.1% |
| mexico | 21000 | 1.1% |
| canada | 16228 | 0.9% |
| Other values (6796) | 927046 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1386625 | 10.2% |
| t | 1386616 | 10.2% |
| 1385915 | 10.2% | |
| e | 1090858 | 8.0% |
| i | 815973 | 6.0% |
| n | 814117 | 6.0% |
| , | 798806 | 5.9% |
| o | 692454 | 5.1% |
| d | 580356 | 4.3% |
| s | 501626 | 3.7% |
| Other values (122) | 4177922 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9267024 | |
| Uppercase Letter | 1826318 | 13.4% |
| Space Separator | 1385915 | 10.2% |
| Other Punctuation | 805648 | 5.9% |
| Open Punctuation | 168013 | 1.2% |
| Close Punctuation | 167964 | 1.2% |
| Dash Punctuation | 10307 | 0.1% |
| Decimal Number | 75 | < 0.1% |
| Modifier Letter | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1386625 | |
| t | 1386616 | |
| e | 1090858 | |
| i | 815973 | |
| n | 814117 | |
| o | 692454 | |
| d | 580356 | |
| s | 501626 | 5.4% |
| r | 454233 | 4.9% |
| l | 313851 | 3.4% |
| Other values (59) | 1230315 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 462242 | |
| U | 242069 | |
| N | 220708 | |
| C | 174669 | 9.6% |
| M | 92421 | 5.1% |
| P | 64234 | 3.5% |
| B | 57594 | 3.2% |
| A | 54173 | 3.0% |
| T | 52081 | 2.9% |
| I | 45075 | 2.5% |
| Other values (27) | 361052 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 798806 | |
| ' | 3983 | 0.5% |
| . | 2433 | 0.3% |
| / | 183 | < 0.1% |
| ? | 152 | < 0.1% |
| & | 50 | < 0.1% |
| : | 39 | < 0.1% |
| ; | 1 | < 0.1% |
| ¡ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 46 | |
| 9 | 14 | 18.7% |
| 4 | 11 | 14.7% |
| 2 | 2 | 2.7% |
| 8 | 1 | 1.3% |
| 1 | 1 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10283 | |
| – | 22 | 0.2% |
| — | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 167979 | |
| ( | 34 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 167930 | |
| ) | 34 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1385915 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11093342 | |
| Common | 2537926 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1386625 | |
| t | 1386616 | |
| e | 1090858 | 9.8% |
| i | 815973 | 7.4% |
| n | 814117 | 7.3% |
| o | 692454 | 6.2% |
| d | 580356 | 5.2% |
| s | 501626 | 4.5% |
| S | 462242 | 4.2% |
| r | 454233 | 4.1% |
| Other values (96) | 2908242 |
Common
| Value | Count | Frequency (%) |
| 1385915 | ||
| , | 798806 | |
| [ | 167979 | 6.6% |
| ] | 167930 | 6.6% |
| - | 10283 | 0.4% |
| ' | 3983 | 0.2% |
| . | 2433 | 0.1% |
| / | 183 | < 0.1% |
| ? | 152 | < 0.1% |
| & | 50 | < 0.1% |
| Other values (16) | 212 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13624976 | |
| None | 6244 | < 0.1% |
| Punctuation | 24 | < 0.1% |
| Latin Ext Additional | 22 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1386625 | 10.2% |
| t | 1386616 | 10.2% |
| 1385915 | 10.2% | |
| e | 1090858 | 8.0% |
| i | 815973 | 6.0% |
| n | 814117 | 6.0% |
| , | 798806 | 5.9% |
| o | 692454 | 5.1% |
| d | 580356 | 4.3% |
| s | 501626 | 3.7% |
| Other values (63) | 4171630 |
None
| Value | Count | Frequency (%) |
| á | 1227 | |
| ü | 1113 | |
| í | 1027 | |
| ó | 731 | |
| é | 700 | |
| ã | 292 | 4.7% |
| ô | 268 | 4.3% |
| ø | 167 | 2.7% |
| è | 135 | 2.2% |
| ä | 68 | 1.1% |
| Other values (45) | 516 |
Punctuation
| Value | Count | Frequency (%) |
| – | 22 | |
| — | 2 | 8.3% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 22 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 2 |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 199137 |
| Missing (%) | 32.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.12657803 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 259896 | |
| asia | 50862 | 12.5% |
| south_america | 49534 | 12.2% |
| africa | 21692 | 5.3% |
| oceania | 14473 | 3.6% |
| europe | 9029 | 2.2% |
| antarctica | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 792923 | |
| R | 600050 | |
| I | 396460 | |
| C | 345601 | |
| E | 341961 | |
| O | 332932 | |
| T | 309436 | 6.9% |
| H | 309430 | 6.9% |
| _ | 309430 | 6.9% |
| M | 309430 | 6.9% |
| Other values (5) | 464052 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4202275 | |
| Connector Punctuation | 309430 | 6.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 792923 | |
| R | 600050 | |
| I | 396460 | |
| C | 345601 | |
| E | 341961 | |
| O | 332932 | |
| T | 309436 | 7.4% |
| H | 309430 | 7.4% |
| M | 309430 | 7.4% |
| N | 274372 | 6.5% |
| Other values (4) | 189680 | 4.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 309430 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4202275 | |
| Common | 309430 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 792923 | |
| R | 600050 | |
| I | 396460 | |
| C | 345601 | |
| E | 341961 | |
| O | 332932 | |
| T | 309436 | 7.4% |
| H | 309430 | 7.4% |
| M | 309430 | 7.4% |
| N | 274372 | 6.5% |
| Other values (4) | 189680 | 4.5% |
Common
| Value | Count | Frequency (%) |
| _ | 309430 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4511705 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 792923 | |
| R | 600050 | |
| I | 396460 | |
| C | 345601 | |
| E | 341961 | |
| O | 332932 | |
| T | 309436 | 6.9% |
| H | 309430 | 6.9% |
| _ | 309430 | 6.9% |
| M | 309430 | 6.9% |
| Other values (5) | 464052 |
islandGroup
Text
Missing 
| Distinct | 72 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 602107 |
| Missing (%) | 99.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 13 |
| Mean length | 13.72052402 |
| Min length | 5 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Sunda Islands |
|---|---|
| 2nd row | Inner Islands |
| 3rd row | Viti Levu Group |
| 4th row | Chuuk Lagoon |
| 5th row | Sunda Islands |
| Value | Count | Frequency (%) |
| islands | 2159 | |
| sunda | 955 | |
| marquesas | 249 | 4.9% |
| solomon | 226 | 4.4% |
| bass | 171 | 3.3% |
| chuuk | 149 | 2.9% |
| lagoon | 149 | 2.9% |
| outer | 149 | 2.9% |
| inner | 140 | 2.7% |
| group | 100 | 2.0% |
| Other values (78) | 673 | 13.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 5363 | |
| a | 4393 | |
| n | 3946 | |
| d | 3264 | |
| 2601 | ||
| l | 2567 | |
| I | 2312 | |
| u | 1952 | 5.6% |
| S | 1249 | 3.6% |
| o | 1226 | 3.5% |
| Other values (39) | 5689 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26822 | |
| Uppercase Letter | 5120 | 14.8% |
| Space Separator | 2601 | 7.5% |
| Other Punctuation | 19 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 5363 | |
| a | 4393 | |
| n | 3946 | |
| d | 3264 | |
| l | 2567 | |
| u | 1952 | 7.3% |
| o | 1226 | 4.6% |
| r | 905 | 3.4% |
| e | 893 | 3.3% |
| i | 343 | 1.3% |
| Other values (14) | 1970 | 7.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 2312 | |
| S | 1249 | |
| M | 256 | 5.0% |
| L | 237 | 4.6% |
| C | 200 | 3.9% |
| B | 171 | 3.3% |
| O | 158 | 3.1% |
| G | 147 | 2.9% |
| V | 87 | 1.7% |
| N | 75 | 1.5% |
| Other values (12) | 228 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 10 | |
| . | 9 |
Space Separator
| Value | Count | Frequency (%) |
| 2601 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31942 | |
| Common | 2620 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 5363 | |
| a | 4393 | |
| n | 3946 | |
| d | 3264 | |
| l | 2567 | |
| I | 2312 | |
| u | 1952 | 6.1% |
| S | 1249 | 3.9% |
| o | 1226 | 3.8% |
| r | 905 | 2.8% |
| Other values (36) | 4765 |
Common
| Value | Count | Frequency (%) |
| 2601 | ||
| ' | 10 | 0.4% |
| . | 9 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34562 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 5363 | |
| a | 4393 | |
| n | 3946 | |
| d | 3264 | |
| 2601 | ||
| l | 2567 | |
| I | 2312 | |
| u | 1952 | 5.6% |
| S | 1249 | 3.6% |
| o | 1226 | 3.5% |
| Other values (39) | 5689 |
island
Text
Missing 
| Distinct | 436 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 595261 |
| Missing (%) | 98.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 9.325680726 |
| Min length | 3 |
Unique
| Unique | 168 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | South Island |
|---|---|
| 2nd row | Pohnpei |
| 3rd row | South Island |
| 4th row | Oahu |
| 5th row | Guadalcanal |
| Value | Count | Frequency (%) |
| island | 3167 | |
| south | 1636 | 11.1% |
| java | 883 | 6.0% |
| levu | 565 | 3.8% |
| viti | 541 | 3.7% |
| north | 519 | 3.5% |
| guadalcanal | 327 | 2.2% |
| borneo | 253 | 1.7% |
| hiva | 247 | 1.7% |
| key | 246 | 1.7% |
| Other values (438) | 6371 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12931 | |
| n | 6143 | 7.0% |
| l | 5485 | 6.3% |
| o | 5446 | 6.2% |
| 5390 | 6.2% | |
| u | 4466 | 5.1% |
| d | 4450 | 5.1% |
| s | 4126 | 4.7% |
| e | 3908 | 4.5% |
| t | 3745 | 4.3% |
| Other values (52) | 31245 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 66993 | |
| Uppercase Letter | 14738 | 16.9% |
| Space Separator | 5390 | 6.2% |
| Other Punctuation | 169 | 0.2% |
| Dash Punctuation | 18 | < 0.1% |
| Open Punctuation | 13 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
| Modifier Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12931 | |
| n | 6143 | |
| l | 5485 | |
| o | 5446 | |
| u | 4466 | 6.7% |
| d | 4450 | 6.6% |
| s | 4126 | 6.2% |
| e | 3908 | 5.8% |
| t | 3745 | 5.6% |
| i | 3651 | 5.4% |
| Other values (19) | 12642 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 3295 | |
| S | 2358 | |
| N | 1067 | 7.2% |
| J | 891 | 6.0% |
| L | 820 | 5.6% |
| B | 722 | 4.9% |
| V | 681 | 4.6% |
| G | 648 | 4.4% |
| M | 648 | 4.4% |
| H | 619 | 4.2% |
| Other values (14) | 2989 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 164 | |
| . | 5 | 3.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 | |
| [ | 1 | 7.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 | |
| ] | 1 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 5390 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 81731 | |
| Common | 5604 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12931 | |
| n | 6143 | 7.5% |
| l | 5485 | 6.7% |
| o | 5446 | 6.7% |
| u | 4466 | 5.5% |
| d | 4450 | 5.4% |
| s | 4126 | 5.0% |
| e | 3908 | 4.8% |
| t | 3745 | 4.6% |
| i | 3651 | 4.5% |
| Other values (43) | 27380 |
Common
| Value | Count | Frequency (%) |
| 5390 | ||
| ' | 164 | 2.9% |
| - | 18 | 0.3% |
| ( | 12 | 0.2% |
| ) | 12 | 0.2% |
| . | 5 | 0.1% |
| ʻ | 1 | < 0.1% |
| [ | 1 | < 0.1% |
| ] | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 87309 | |
| None | 25 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12931 | |
| n | 6143 | 7.0% |
| l | 5485 | 6.3% |
| o | 5446 | 6.2% |
| 5390 | 6.2% | |
| u | 4466 | 5.1% |
| d | 4450 | 5.1% |
| s | 4126 | 4.7% |
| e | 3908 | 4.5% |
| t | 3745 | 4.3% |
| Other values (47) | 31219 |
None
| Value | Count | Frequency (%) |
| ñ | 13 | |
| ó | 7 | |
| é | 4 | 16.0% |
| Ž | 1 | 4.0% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
countryCode
Text
Missing 
| Distinct | 217 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 163440 |
| Missing (%) | 27.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | CR |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 217888 | |
| ca | 16227 | 3.7% |
| mx | 15807 | 3.6% |
| cn | 14551 | 3.3% |
| br | 12970 | 2.9% |
| cr | 8902 | 2.0% |
| pe | 7635 | 1.7% |
| in | 7034 | 1.6% |
| ph | 6836 | 1.5% |
| pa | 6325 | 1.4% |
| Other values (207) | 127011 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 226074 | |
| S | 225013 | |
| C | 60558 | 6.9% |
| A | 35637 | 4.0% |
| P | 33561 | 3.8% |
| R | 32631 | 3.7% |
| N | 30863 | 3.5% |
| M | 28480 | 3.2% |
| E | 27423 | 3.1% |
| B | 22245 | 2.5% |
| Other values (16) | 159887 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 882372 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 226074 | |
| S | 225013 | |
| C | 60558 | 6.9% |
| A | 35637 | 4.0% |
| P | 33561 | 3.8% |
| R | 32631 | 3.7% |
| N | 30863 | 3.5% |
| M | 28480 | 3.2% |
| E | 27423 | 3.1% |
| B | 22245 | 2.5% |
| Other values (16) | 159887 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 882372 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 226074 | |
| S | 225013 | |
| C | 60558 | 6.9% |
| A | 35637 | 4.0% |
| P | 33561 | 3.8% |
| R | 32631 | 3.7% |
| N | 30863 | 3.5% |
| M | 28480 | 3.2% |
| E | 27423 | 3.1% |
| B | 22245 | 2.5% |
| Other values (16) | 159887 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 882372 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 226074 | |
| S | 225013 | |
| C | 60558 | 6.9% |
| A | 35637 | 4.0% |
| P | 33561 | 3.8% |
| R | 32631 | 3.7% |
| N | 30863 | 3.5% |
| M | 28480 | 3.2% |
| E | 27423 | 3.1% |
| B | 22245 | 2.5% |
| Other values (16) | 159887 |
stateProvince
Text
Missing 
| Distinct | 3068 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 173217 |
| Missing (%) | 28.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 44 |
| Mean length | 9.044864618 |
| Min length | 2 |
Unique
| Unique | 808 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | Cartago |
| 3rd row | Alaska |
| 4th row | Virginia |
| 5th row | New York |
| Value | Count | Frequency (%) |
| not | 29432 | 5.2% |
| stated | 29432 | 5.2% |
| california | 23319 | 4.1% |
| virginia | 22011 | 3.9% |
| colorado | 20952 | 3.7% |
| new | 16649 | 2.9% |
| texas | 12340 | 2.2% |
| arizona | 12144 | 2.1% |
| florida | 9882 | 1.7% |
| maryland | 9606 | 1.7% |
| Other values (2915) | 379808 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 524269 | 13.4% |
| o | 333137 | 8.5% |
| i | 321738 | 8.2% |
| n | 299056 | 7.7% |
| r | 250043 | 6.4% |
| e | 216664 | 5.6% |
| t | 208608 | 5.3% |
| s | 151897 | 3.9% |
| l | 138272 | 3.5% |
| 134166 | 3.4% | |
| Other values (106) | 1324186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3135181 | |
| Uppercase Letter | 563753 | 14.4% |
| Space Separator | 134166 | 3.4% |
| Open Punctuation | 29401 | 0.8% |
| Close Punctuation | 29392 | 0.8% |
| Dash Punctuation | 8108 | 0.2% |
| Other Punctuation | 1958 | 0.1% |
| Decimal Number | 75 | < 0.1% |
| Modifier Letter | 1 | < 0.1% |
| Control | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 524269 | |
| o | 333137 | |
| i | 321738 | |
| n | 299056 | |
| r | 250043 | |
| e | 216664 | 6.9% |
| t | 208608 | 6.7% |
| s | 151897 | 4.8% |
| l | 138272 | 4.4% |
| d | 112993 | 3.6% |
| Other values (49) | 578504 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 79646 | |
| N | 67131 | |
| S | 61370 | |
| M | 46167 | 8.2% |
| T | 31296 | 5.6% |
| A | 30285 | 5.4% |
| V | 29054 | 5.2% |
| W | 27111 | 4.8% |
| P | 20324 | 3.6% |
| I | 18301 | 3.2% |
| Other values (25) | 153068 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 987 | |
| ' | 638 | |
| ? | 138 | 7.0% |
| / | 121 | 6.2% |
| , | 70 | 3.6% |
| : | 3 | 0.2% |
| ¡ | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 46 | |
| 9 | 14 | 18.7% |
| 4 | 11 | 14.7% |
| 2 | 2 | 2.7% |
| 8 | 1 | 1.3% |
| 1 | 1 | 1.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 29400 | |
| ( | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 29391 | |
| ) | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8086 | |
| – | 22 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 134166 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3698934 | |
| Common | 203102 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 524269 | |
| o | 333137 | 9.0% |
| i | 321738 | 8.7% |
| n | 299056 | 8.1% |
| r | 250043 | 6.8% |
| e | 216664 | 5.9% |
| t | 208608 | 5.6% |
| s | 151897 | 4.1% |
| l | 138272 | 3.7% |
| d | 112993 | 3.1% |
| Other values (84) | 1142257 |
Common
| Value | Count | Frequency (%) |
| 134166 | ||
| [ | 29400 | 14.5% |
| ] | 29391 | 14.5% |
| - | 8086 | 4.0% |
| . | 987 | 0.5% |
| ' | 638 | 0.3% |
| ? | 138 | 0.1% |
| / | 121 | 0.1% |
| , | 70 | < 0.1% |
| 3 | 46 | < 0.1% |
| Other values (12) | 59 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3896893 | |
| None | 5098 | 0.1% |
| Latin Ext Additional | 22 | < 0.1% |
| Punctuation | 22 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 524269 | 13.5% |
| o | 333137 | 8.5% |
| i | 321738 | 8.3% |
| n | 299056 | 7.7% |
| r | 250043 | 6.4% |
| e | 216664 | 5.6% |
| t | 208608 | 5.4% |
| s | 151897 | 3.9% |
| l | 138272 | 3.5% |
| 134166 | 3.4% | |
| Other values (60) | 1319043 |
None
| Value | Count | Frequency (%) |
| á | 1200 | |
| ü | 990 | |
| í | 928 | |
| ó | 488 | |
| é | 410 | 8.0% |
| ã | 292 | 5.7% |
| ø | 158 | 3.1% |
| ô | 125 | 2.5% |
| è | 117 | 2.3% |
| ä | 54 | 1.1% |
| Other values (33) | 336 | 6.6% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 22 |
Punctuation
| Value | Count | Frequency (%) |
| – | 22 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
county
Text
Missing 
| Distinct | 4068 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 254826 |
| Missing (%) | 42.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 51 |
|---|---|
| Median length | 45 |
| Mean length | 9.456223556 |
| Min length | 1 |
Unique
| Unique | 1157 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | [Not Stated] |
| 3rd row | Aleutians West |
| 4th row | Virginia Beach |
| 5th row | [Not Stated] |
| Value | Count | Frequency (%) |
| not | 132036 | |
| stated | 132034 | |
| boulder | 6789 | 1.3% |
| creek | 6760 | 1.3% |
| clear | 6751 | 1.3% |
| san | 5404 | 1.0% |
| montgomery | 4939 | 0.9% |
| cochise | 4320 | 0.8% |
| prince | 3491 | 0.7% |
| tuolumne | 3205 | 0.6% |
| Other values (4079) | 215253 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 455384 | |
| a | 309851 | 9.4% |
| e | 305684 | 9.2% |
| o | 264700 | 8.0% |
| 171182 | 5.2% | |
| d | 169196 | 5.1% |
| S | 152102 | 4.6% |
| N | 137663 | 4.2% |
| n | 133833 | 4.0% |
| [ | 132054 | 4.0% |
| Other values (88) | 1076138 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2346650 | |
| Uppercase Letter | 519082 | 15.7% |
| Space Separator | 171182 | 5.2% |
| Open Punctuation | 132072 | 4.0% |
| Close Punctuation | 132032 | 4.0% |
| Other Punctuation | 4600 | 0.1% |
| Dash Punctuation | 2168 | 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 455384 | |
| a | 309851 | |
| e | 305684 | |
| o | 264700 | |
| d | 169196 | 7.2% |
| n | 133833 | 5.7% |
| r | 128718 | 5.5% |
| i | 96626 | 4.1% |
| l | 92683 | 3.9% |
| s | 72029 | 3.1% |
| Other values (42) | 317946 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 152102 | |
| N | 137663 | |
| C | 39705 | 7.6% |
| B | 24568 | 4.7% |
| M | 21490 | 4.1% |
| P | 16764 | 3.2% |
| W | 13595 | 2.6% |
| L | 12293 | 2.4% |
| G | 12064 | 2.3% |
| T | 10761 | 2.1% |
| Other values (23) | 78077 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 3064 | |
| . | 1321 | |
| , | 105 | 2.3% |
| / | 56 | 1.2% |
| & | 50 | 1.1% |
| ? | 4 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 132054 | |
| ( | 18 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 132014 | |
| ) | 18 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 171182 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2168 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2865732 | |
| Common | 442055 | 13.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 455384 | |
| a | 309851 | |
| e | 305684 | |
| o | 264700 | 9.2% |
| d | 169196 | 5.9% |
| S | 152102 | 5.3% |
| N | 137663 | 4.8% |
| n | 133833 | 4.7% |
| r | 128718 | 4.5% |
| i | 96626 | 3.4% |
| Other values (75) | 711975 |
Common
| Value | Count | Frequency (%) |
| 171182 | ||
| [ | 132054 | |
| ] | 132014 | |
| ' | 3064 | 0.7% |
| - | 2168 | 0.5% |
| . | 1321 | 0.3% |
| , | 105 | < 0.1% |
| / | 56 | < 0.1% |
| & | 50 | < 0.1% |
| ( | 18 | < 0.1% |
| Other values (3) | 23 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3306740 | |
| None | 1047 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 455384 | |
| a | 309851 | 9.4% |
| e | 305684 | 9.2% |
| o | 264700 | 8.0% |
| 171182 | 5.2% | |
| d | 169196 | 5.1% |
| S | 152102 | 4.6% |
| N | 137663 | 4.2% |
| n | 133833 | 4.0% |
| [ | 132054 | 4.0% |
| Other values (55) | 1075091 |
None
| Value | Count | Frequency (%) |
| é | 285 | |
| ó | 235 | |
| ü | 123 | |
| í | 99 | 9.5% |
| ô | 74 | 7.1% |
| Ñ | 29 | 2.8% |
| á | 27 | 2.6% |
| è | 18 | 1.7% |
| ś | 16 | 1.5% |
| ć | 15 | 1.4% |
| Other values (23) | 126 |
locality
Text
Missing 
| Distinct | 76610 |
|---|---|
| Distinct (%) | 17.2% |
| Missing | 158340 |
| Missing (%) | 26.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 400600 |
|---|---|
| Median length | 180 |
| Mean length | 23.74718454 |
| Min length | 1 |
Unique
| Unique | 44457 ? |
|---|---|
| Unique (%) | 10.0% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | Rio Aquiares, Turrialba |
| 3rd row | Saint Paul Island, Bering Sea |
| 4th row | False Cape State Park, Wash Woods, 100 meters east of Interpreter's residence |
| 5th row | [Not Stated] |
| Value | Count | Frequency (%) |
| not | 65922 | 4.1% |
| stated | 65846 | 4.1% |
| of | 42709 | 2.7% |
| miles | 21197 | 1.3% |
| kilometers | 15776 | 1.0% |
| park | 15452 | 1.0% |
| river | 15349 | 1.0% |
| lake | 14837 | 0.9% |
| near | 12849 | 0.8% |
| creek | 12664 | 0.8% |
| Other values (56182) | 1322830 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1107495 | 10.5% | |
| a | 957781 | 9.0% |
| e | 764045 | 7.2% |
| o | 666061 | 6.3% |
| t | 632687 | 6.0% |
| n | 514072 | 4.9% |
| i | 493725 | 4.7% |
| r | 484811 | 4.6% |
| l | 394187 | 3.7% |
| s | 365079 | 3.4% |
| Other values (139) | 4218093 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7051212 | |
| Uppercase Letter | 1363576 | 12.9% |
| Space Separator | 1107495 | 10.5% |
| Decimal Number | 378240 | 3.6% |
| Other Punctuation | 330337 | 3.1% |
| Control | 166947 | 1.6% |
| Open Punctuation | 78089 | 0.7% |
| Close Punctuation | 78075 | 0.7% |
| Dash Punctuation | 31559 | 0.3% |
| Connector Punctuation | 11549 | 0.1% |
| Other values (6) | 957 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 957781 | |
| e | 764045 | |
| o | 666061 | |
| t | 632687 | |
| n | 514072 | 7.3% |
| i | 493725 | 7.0% |
| r | 484811 | 6.9% |
| l | 394187 | 5.6% |
| s | 365079 | 5.2% |
| u | 257241 | 3.6% |
| Other values (49) | 1521523 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 179234 | |
| C | 131330 | 9.6% |
| N | 130256 | 9.6% |
| R | 94760 | 6.9% |
| P | 94322 | 6.9% |
| M | 86675 | 6.4% |
| B | 66017 | 4.8% |
| L | 59567 | 4.4% |
| A | 56700 | 4.2% |
| T | 52375 | 3.8% |
| Other values (30) | 412340 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 154021 | |
| . | 80280 | |
| ; | 58000 | 17.6% |
| : | 17454 | 5.3% |
| ' | 9316 | 2.8% |
| / | 6838 | 2.1% |
| " | 1470 | 0.4% |
| ? | 1401 | 0.4% |
| & | 867 | 0.3% |
| # | 653 | 0.2% |
| Other values (5) | 37 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 67501 | |
| 0 | 62313 | |
| 2 | 51275 | |
| 5 | 35588 | |
| 3 | 34065 | |
| 4 | 32368 | |
| 6 | 26433 | 7.0% |
| 8 | 24052 | 6.4% |
| 9 | 22778 | 6.0% |
| 7 | 21867 | 5.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 299 | |
| ~ | 250 | |
| = | 121 | |
| | | 85 | 11.2% |
| < | 2 | 0.3% |
| > | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 69831 | |
| ( | 8154 | 10.4% |
| { | 103 | 0.1% |
| ‚ | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 166196 | ||
| 749 | 0.4% | |
| | 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 69787 | |
| ) | 8145 | 10.4% |
| } | 143 | 0.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 3 | |
| ¯ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1107495 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 31559 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 11549 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 112 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 50 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 26 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8414788 | |
| Common | 2183248 | 20.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 957781 | 11.4% |
| e | 764045 | 9.1% |
| o | 666061 | 7.9% |
| t | 632687 | 7.5% |
| n | 514072 | 6.1% |
| i | 493725 | 5.9% |
| r | 484811 | 5.8% |
| l | 394187 | 4.7% |
| s | 365079 | 4.3% |
| u | 257241 | 3.1% |
| Other values (89) | 2885099 |
Common
| Value | Count | Frequency (%) |
| 1107495 | ||
| 166196 | 7.6% | |
| , | 154021 | 7.1% |
| . | 80280 | 3.7% |
| [ | 69831 | 3.2% |
| ] | 69787 | 3.2% |
| 1 | 67501 | 3.1% |
| 0 | 62313 | 2.9% |
| ; | 58000 | 2.7% |
| 2 | 51275 | 2.3% |
| Other values (40) | 296549 | 13.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10595457 | |
| None | 2545 | < 0.1% |
| Punctuation | 34 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1107495 | 10.5% | |
| a | 957781 | 9.0% |
| e | 764045 | 7.2% |
| o | 666061 | 6.3% |
| t | 632687 | 6.0% |
| n | 514072 | 4.9% |
| i | 493725 | 4.7% |
| r | 484811 | 4.6% |
| l | 394187 | 3.7% |
| s | 365079 | 3.4% |
| Other values (82) | 4215514 |
None
| Value | Count | Frequency (%) |
| ñ | 374 | |
| ó | 352 | |
| é | 344 | |
| á | 340 | |
| ã | 220 | |
| ü | 181 | |
| í | 149 | 5.9% |
| ç | 117 | 4.6% |
| ° | 112 | 4.4% |
| ¢ | 50 | 2.0% |
| Other values (43) | 306 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 26 | |
| “ | 6 | 17.6% |
| ‚ | 1 | 2.9% |
| … | 1 | 2.9% |
Missing 
| Distinct | 1024 |
|---|---|
| Distinct (%) | 10.3% |
| Missing | 594692 |
| Missing (%) | 98.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 94 |
|---|---|
| Median length | 31 |
| Mean length | 8.08838333 |
| Min length | 1 |
Unique
| Unique | 334 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | 140 meters |
|---|---|
| 2nd row | 3900 feet |
| 3rd row | 5940 feet |
| 4th row | 180 meters |
| 5th row | 3000 feet |
| Value | Count | Frequency (%) |
| m | 2782 | 14.5% |
| feet | 2472 | 12.9% |
| meters | 1521 | 7.9% |
| ft | 1465 | 7.6% |
| 1000 | 347 | 1.8% |
| level | 318 | 1.7% |
| sea | 318 | 1.7% |
| 300 | 305 | 1.6% |
| near | 276 | 1.4% |
| 3200 | 236 | 1.2% |
| Other values (619) | 9192 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 16889 | |
| e | 9358 | |
| 9298 | ||
| t | 5738 | 7.1% |
| m | 5102 | 6.3% |
| f | 4103 | 5.1% |
| 1 | 4088 | 5.1% |
| 5 | 3791 | 4.7% |
| 2 | 2912 | 3.6% |
| . | 2458 | 3.1% |
| Other values (44) | 16613 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36341 | |
| Lowercase Letter | 30893 | |
| Space Separator | 9298 | 11.6% |
| Other Punctuation | 2945 | 3.7% |
| Dash Punctuation | 765 | 1.0% |
| Uppercase Letter | 44 | 0.1% |
| Open Punctuation | 23 | < 0.1% |
| Close Punctuation | 23 | < 0.1% |
| Math Symbol | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9358 | |
| t | 5738 | |
| m | 5102 | |
| f | 4103 | |
| r | 1891 | 6.1% |
| s | 1851 | 6.0% |
| a | 854 | 2.8% |
| l | 695 | 2.2% |
| n | 346 | 1.1% |
| v | 331 | 1.1% |
| Other values (12) | 624 | 2.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 16889 | |
| 1 | 4088 | 11.2% |
| 5 | 3791 | 10.4% |
| 2 | 2912 | 8.0% |
| 3 | 2121 | 5.8% |
| 4 | 1908 | 5.3% |
| 6 | 1282 | 3.5% |
| 7 | 1249 | 3.4% |
| 8 | 1174 | 3.2% |
| 9 | 927 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 30 | |
| N | 5 | 11.4% |
| L | 3 | 6.8% |
| A | 2 | 4.5% |
| P | 1 | 2.3% |
| B | 1 | 2.3% |
| S | 1 | 2.3% |
| W | 1 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2458 | |
| ' | 338 | 11.5% |
| , | 126 | 4.3% |
| & | 13 | 0.4% |
| ? | 9 | 0.3% |
| / | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22 | |
| [ | 1 | 4.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22 | |
| ] | 1 | 4.3% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 17 | |
| + | 1 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 9298 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 765 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49413 | |
| Latin | 30937 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9358 | |
| t | 5738 | |
| m | 5102 | |
| f | 4103 | |
| r | 1891 | 6.1% |
| s | 1851 | 6.0% |
| a | 854 | 2.8% |
| l | 695 | 2.2% |
| n | 346 | 1.1% |
| v | 331 | 1.1% |
| Other values (20) | 668 | 2.2% |
Common
| Value | Count | Frequency (%) |
| 0 | 16889 | |
| 9298 | ||
| 1 | 4088 | 8.3% |
| 5 | 3791 | 7.7% |
| 2 | 2912 | 5.9% |
| . | 2458 | 5.0% |
| 3 | 2121 | 4.3% |
| 4 | 1908 | 3.9% |
| 6 | 1282 | 2.6% |
| 7 | 1249 | 2.5% |
| Other values (14) | 3417 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 80350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 16889 | |
| e | 9358 | |
| 9298 | ||
| t | 5738 | 7.1% |
| m | 5102 | 6.3% |
| f | 4103 | 5.1% |
| 1 | 4088 | 5.1% |
| 5 | 3791 | 4.7% |
| 2 | 2912 | 3.6% |
| . | 2458 | 3.1% |
| Other values (44) | 16613 |
verbatimDepth
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 16.7% |
| Missing | 604620 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 25 |
| Mean length | 25 |
| Min length | 25 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 220m inside cave entrance |
|---|---|
| 2nd row | 220m inside cave entrance |
| 3rd row | 220m inside cave entrance |
| 4th row | 220m inside cave entrance |
| 5th row | 220m inside cave entrance |
| Value | Count | Frequency (%) |
| 220m | 6 | |
| inside | 6 | |
| cave | 6 | |
| entrance | 6 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 24 | |
| 18 | ||
| n | 18 | |
| 2 | 12 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| 0 | 6 | 4.0% |
| m | 6 | 4.0% |
| s | 6 | 4.0% |
| Other values (4) | 24 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114 | |
| Space Separator | 18 | 12.0% |
| Decimal Number | 18 | 12.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 24 | |
| n | 18 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| m | 6 | 5.3% |
| s | 6 | 5.3% |
| d | 6 | 5.3% |
| v | 6 | 5.3% |
| t | 6 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 12 | |
| 0 | 6 |
Space Separator
| Value | Count | Frequency (%) |
| 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 114 | |
| Common | 36 | 24.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 24 | |
| n | 18 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| m | 6 | 5.3% |
| s | 6 | 5.3% |
| d | 6 | 5.3% |
| v | 6 | 5.3% |
| t | 6 | 5.3% |
Common
| Value | Count | Frequency (%) |
| 18 | ||
| 2 | 12 | |
| 0 | 6 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 150 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 24 | |
| 18 | ||
| n | 18 | |
| 2 | 12 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| 0 | 6 | 4.0% |
| m | 6 | 4.0% |
| s | 6 | 4.0% |
| Other values (4) | 24 |
minimumDistanceAboveSurfaceInMeters
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 15.5 |
| Mean length | 15.5 |
| Min length | 12 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Poole, R. W. |
|---|---|
| 2nd row | Garrison, Rosser W. |
| Value | Count | Frequency (%) |
| w | 2 | |
| poole | 1 | |
| r | 1 | |
| garrison | 1 | |
| rosser | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 4 | |
| 4 | ||
| . | 3 | |
| r | 3 | |
| s | 3 | |
| e | 2 | 6.5% |
| , | 2 | 6.5% |
| R | 2 | 6.5% |
| W | 2 | 6.5% |
| P | 1 | 3.2% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 6 | 19.4% |
| Other Punctuation | 5 | 16.1% |
| Space Separator | 4 | 12.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 4 | |
| r | 3 | |
| s | 3 | |
| e | 2 | |
| l | 1 | 6.2% |
| a | 1 | 6.2% |
| i | 1 | 6.2% |
| n | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2 | |
| W | 2 | |
| P | 1 | |
| G | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22 | |
| Common | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 4 | |
| r | 3 | |
| s | 3 | |
| e | 2 | |
| R | 2 | |
| W | 2 | |
| P | 1 | 4.5% |
| l | 1 | 4.5% |
| G | 1 | 4.5% |
| a | 1 | 4.5% |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| . | 3 | |
| , | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 4 | |
| 4 | ||
| . | 3 | |
| r | 3 | |
| s | 3 | |
| e | 2 | 6.5% |
| , | 2 | 6.5% |
| R | 2 | 6.5% |
| W | 2 | 6.5% |
| P | 1 | 3.2% |
| Other values (5) | 5 |
decimalLatitude
Text
Missing 
| Distinct | 38003 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 285575 |
| Missing (%) | 47.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.690350446 |
| Min length | 3 |
Unique
| Unique | 15800 ? |
|---|---|
| Unique (%) | 5.0% |
Sample
| 1st row | 9.91378 |
|---|---|
| 2nd row | 57.18 |
| 3rd row | 36.5787 |
| 4th row | 15.5864 |
| 5th row | 45.4838 |
| Value | Count | Frequency (%) |
| 39.6891 | 5053 | 1.6% |
| 60.75 | 3839 | 1.2% |
| 60.7493 | 2462 | 0.8% |
| 40.0925 | 2379 | 0.7% |
| 38.02 | 2013 | 0.6% |
| 42.7299 | 1697 | 0.5% |
| 37.23 | 1343 | 0.4% |
| 40.015 | 1287 | 0.4% |
| 42.78 | 1170 | 0.4% |
| 38.9559 | 1141 | 0.4% |
| Other values (37323) | 296667 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 319051 | |
| 3 | 273937 | |
| 4 | 209092 | |
| 1 | 188994 | |
| 2 | 172377 | |
| 9 | 169623 | |
| 7 | 165639 | |
| 8 | 159036 | |
| 5 | 153205 | |
| 6 | 152373 | |
| Other values (3) | 171236 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1774325 | |
| Other Punctuation | 319051 | 14.9% |
| Dash Punctuation | 41186 | 1.9% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 273937 | |
| 4 | 209092 | |
| 1 | 188994 | |
| 2 | 172377 | |
| 9 | 169623 | |
| 7 | 165639 | |
| 8 | 159036 | |
| 5 | 153205 | |
| 6 | 152373 | |
| 0 | 130049 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 319051 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41186 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2134562 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 319051 | |
| 3 | 273937 | |
| 4 | 209092 | |
| 1 | 188994 | |
| 2 | 172377 | |
| 9 | 169623 | |
| 7 | 165639 | |
| 8 | 159036 | |
| 5 | 153205 | |
| 6 | 152373 | |
| Other values (2) | 171235 |
Latin
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2134563 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 319051 | |
| 3 | 273937 | |
| 4 | 209092 | |
| 1 | 188994 | |
| 2 | 172377 | |
| 9 | 169623 | |
| 7 | 165639 | |
| 8 | 159036 | |
| 5 | 153205 | |
| 6 | 152373 | |
| Other values (3) | 171236 |
decimalLongitude
Text
Missing 
| Distinct | 36962 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 285575 |
| Missing (%) | 47.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 7.477729266 |
| Min length | 3 |
Unique
| Unique | 15095 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | -83.6744 |
|---|---|
| 2nd row | -170.27 |
| 3rd row | -75.8881 |
| 4th row | -61.4739 |
| 5th row | -75.9727 |
| Value | Count | Frequency (%) |
| 105.644 | 5103 | 1.6% |
| 139.5 | 3837 | 1.2% |
| 139.504 | 2462 | 0.8% |
| 105.358 | 2379 | 0.7% |
| 87.8123 | 1697 | 0.5% |
| 119.93 | 1404 | 0.4% |
| 105.27 | 1361 | 0.4% |
| 80.4178 | 1322 | 0.4% |
| 0.365 | 1301 | 0.4% |
| 87.76 | 1163 | 0.4% |
| Other values (36457) | 297022 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 319051 | |
| 1 | 292907 | |
| - | 270810 | |
| 7 | 217640 | |
| 8 | 193876 | |
| 6 | 165493 | |
| 5 | 162714 | |
| 3 | 158462 | |
| 2 | 156819 | |
| 9 | 154516 | |
| Other values (2) | 293489 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1795916 | |
| Other Punctuation | 319051 | 13.4% |
| Dash Punctuation | 270810 | 11.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 292907 | |
| 7 | 217640 | |
| 8 | 193876 | |
| 6 | 165493 | |
| 5 | 162714 | |
| 3 | 158462 | |
| 2 | 156819 | |
| 9 | 154516 | |
| 4 | 148470 | |
| 0 | 145019 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 319051 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 270810 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2385777 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 319051 | |
| 1 | 292907 | |
| - | 270810 | |
| 7 | 217640 | |
| 8 | 193876 | |
| 6 | 165493 | |
| 5 | 162714 | |
| 3 | 158462 | |
| 2 | 156819 | |
| 9 | 154516 | |
| Other values (2) | 293489 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2385777 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 319051 | |
| 1 | 292907 | |
| - | 270810 | |
| 7 | 217640 | |
| 8 | 193876 | |
| 6 | 165493 | |
| 5 | 162714 | |
| 3 | 158462 | |
| 2 | 156819 | |
| 9 | 154516 | |
| Other values (2) | 293489 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 1493 |
|---|---|
| Distinct (%) | 12.5% |
| Missing | 592674 |
| Missing (%) | 98.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.138386881 |
| Min length | 4 |
Unique
| Unique | 745 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | 931.0 |
|---|---|
| 2nd row | 10206.0 |
| 3rd row | 6642.0 |
| 4th row | 3036.0 |
| 5th row | 301.0 |
| Value | Count | Frequency (%) |
| 3036.0 | 1744 | 14.6% |
| 301.0 | 466 | 3.9% |
| 34239.0 | 426 | 3.6% |
| 1189.0 | 258 | 2.2% |
| 20000.0 | 247 | 2.1% |
| 3048.0 | 220 | 1.8% |
| 15000.0 | 199 | 1.7% |
| 52150.0 | 194 | 1.6% |
| 14563.0 | 162 | 1.4% |
| 9346.0 | 135 | 1.1% |
| Other values (1483) | 7901 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 21190 | |
| . | 11952 | |
| 3 | 8252 | 11.2% |
| 1 | 6352 | 8.7% |
| 2 | 4892 | 6.7% |
| 6 | 4647 | 6.3% |
| 4 | 3910 | 5.3% |
| 5 | 3500 | 4.8% |
| 9 | 3065 | 4.2% |
| 8 | 2861 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 61414 | |
| Other Punctuation | 11952 | 16.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 21190 | |
| 3 | 8252 | 13.4% |
| 1 | 6352 | 10.3% |
| 2 | 4892 | 8.0% |
| 6 | 4647 | 7.6% |
| 4 | 3910 | 6.4% |
| 5 | 3500 | 5.7% |
| 9 | 3065 | 5.0% |
| 8 | 2861 | 4.7% |
| 7 | 2745 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11952 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 73366 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 21190 | |
| . | 11952 | |
| 3 | 8252 | 11.2% |
| 1 | 6352 | 8.7% |
| 2 | 4892 | 6.7% |
| 6 | 4647 | 6.3% |
| 4 | 3910 | 5.3% |
| 5 | 3500 | 4.8% |
| 9 | 3065 | 4.2% |
| 8 | 2861 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73366 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 21190 | |
| . | 11952 | |
| 3 | 8252 | 11.2% |
| 1 | 6352 | 8.7% |
| 2 | 4892 | 6.7% |
| 6 | 4647 | 6.3% |
| 4 | 3910 | 5.3% |
| 5 | 3500 | 4.8% |
| 9 | 3065 | 4.2% |
| 8 | 2861 | 3.9% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1937011 |
|---|---|
| 2nd row | 1424710 |
| Value | Count | Frequency (%) |
| 1937011 | 1 | |
| 1424710 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|
| Value | Count | Frequency (%) |
| degrees | 1 | |
| minutes | 1 | |
| seconds | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| 2 | 8.7% | |
| n | 2 | 8.7% |
| D | 1 | 4.3% |
| g | 1 | 4.3% |
| r | 1 | 4.3% |
| M | 1 | 4.3% |
| i | 1 | 4.3% |
| u | 1 | 4.3% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Uppercase Letter | 3 | 13.0% |
| Space Separator | 2 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| n | 2 | 11.1% |
| g | 1 | 5.6% |
| r | 1 | 5.6% |
| i | 1 | 5.6% |
| u | 1 | 5.6% |
| t | 1 | 5.6% |
| c | 1 | 5.6% |
| o | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| M | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 | |
| Common | 2 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| n | 2 | 9.5% |
| D | 1 | 4.8% |
| g | 1 | 4.8% |
| r | 1 | 4.8% |
| M | 1 | 4.8% |
| i | 1 | 4.8% |
| u | 1 | 4.8% |
| t | 1 | 4.8% |
| Other values (4) | 4 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| 2 | 8.7% | |
| n | 2 | 8.7% |
| D | 1 | 4.3% |
| g | 1 | 4.3% |
| r | 1 | 4.3% |
| M | 1 | 4.3% |
| i | 1 | 4.3% |
| u | 1 | 4.3% |
| Other values (5) | 5 |
verbatimSRS
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1973-05-08 |
|---|
| Value | Count | Frequency (%) |
| 1973-05-08 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 2 | |
| 0 | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 | |
| Dash Punctuation | 2 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 8 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2 | |
| 0 | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 2 | |
| 0 | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 8 | 1 |
footprintSRS
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 128 |
|---|
| Value | Count | Frequency (%) |
| 128 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 128 |
|---|
| Value | Count | Frequency (%) |
| 128 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
georeferencedBy
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604623 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 32 |
| Mean length | 23.66666667 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Troides amphrysus (Cramer, 1779) |
|---|---|
| 2nd row | Gynacantha membranalis Karsch, 1891 |
| 3rd row | 1973 |
| Value | Count | Frequency (%) |
| troides | 1 | |
| amphrysus | 1 | |
| cramer | 1 | |
| 1779 | 1 | |
| gynacantha | 1 | |
| membranalis | 1 | |
| karsch | 1 | |
| 1891 | 1 | |
| 1973 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | 11.3% |
| r | 6 | 8.5% |
| 6 | 8.5% | |
| s | 5 | 7.0% |
| m | 4 | 5.6% |
| 1 | 4 | 5.6% |
| 9 | 3 | 4.2% |
| 7 | 3 | 4.2% |
| e | 3 | 4.2% |
| n | 3 | 4.2% |
| Other values (20) | 26 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45 | |
| Decimal Number | 12 | 16.9% |
| Space Separator | 6 | 8.5% |
| Uppercase Letter | 4 | 5.6% |
| Other Punctuation | 2 | 2.8% |
| Close Punctuation | 1 | 1.4% |
| Open Punctuation | 1 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| r | 6 | |
| s | 5 | |
| m | 4 | |
| e | 3 | 6.7% |
| n | 3 | 6.7% |
| h | 3 | 6.7% |
| c | 2 | 4.4% |
| i | 2 | 4.4% |
| y | 2 | 4.4% |
| Other values (7) | 7 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 9 | 3 | |
| 7 | 3 | |
| 8 | 1 | 8.3% |
| 3 | 1 | 8.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| K | 1 | |
| C | 1 | |
| G | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49 | |
| Common | 22 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| r | 6 | |
| s | 5 | |
| m | 4 | 8.2% |
| e | 3 | 6.1% |
| n | 3 | 6.1% |
| h | 3 | 6.1% |
| c | 2 | 4.1% |
| i | 2 | 4.1% |
| y | 2 | 4.1% |
| Other values (11) | 11 |
Common
| Value | Count | Frequency (%) |
| 6 | ||
| 1 | 4 | |
| 9 | 3 | |
| 7 | 3 | |
| , | 2 | 9.1% |
| 8 | 1 | 4.5% |
| ) | 1 | 4.5% |
| ( | 1 | 4.5% |
| 3 | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 71 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | 11.3% |
| r | 6 | 8.5% |
| 6 | 8.5% | |
| s | 5 | 7.0% |
| m | 4 | 5.6% |
| 1 | 4 | 5.6% |
| 9 | 3 | 4.2% |
| 7 | 3 | 4.2% |
| e | 3 | 4.2% |
| n | 3 | 4.2% |
| Other values (20) | 26 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 5 |
|---|
| Value | Count | Frequency (%) |
| 5 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 1 |
Missing 
| Distinct | 65 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 366755 |
| Missing (%) | 60.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 72 |
|---|---|
| Median length | 12 |
| Mean length | 10.94743369 |
| Min length | 1 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Google Maps |
|---|---|
| 2nd row | Google Earth |
| 3rd row | Google Earth |
| 4th row | GEOLocate |
| 5th row | Google Earth |
| Value | Count | Frequency (%) |
| 163378 | ||
| earth | 120763 | |
| geolocate | 70753 | |
| maps | 42641 | 10.5% |
| gps | 1516 | 0.4% |
| coordinates | 782 | 0.2% |
| centroid | 781 | 0.2% |
| geonames | 718 | 0.2% |
| from | 711 | 0.2% |
| country | 671 | 0.2% |
| Other values (106) | 2062 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 402567 | |
| e | 238609 | |
| a | 237477 | |
| G | 236541 | |
| t | 194803 | |
| E | 191420 | |
| l | 169480 | 6.5% |
| 166905 | 6.4% | |
| g | 163810 | 6.3% |
| r | 124366 | 4.8% |
| Other values (51) | 478099 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1823782 | |
| Uppercase Letter | 612188 | 23.5% |
| Space Separator | 166905 | 6.4% |
| Decimal Number | 942 | < 0.1% |
| Other Punctuation | 250 | < 0.1% |
| Dash Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 402567 | |
| e | 238609 | |
| a | 237477 | |
| t | 194803 | |
| l | 169480 | |
| g | 163810 | |
| r | 124366 | 6.8% |
| h | 120864 | 6.6% |
| c | 72653 | 4.0% |
| s | 44346 | 2.4% |
| Other values (14) | 54807 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 236541 | |
| E | 191420 | |
| O | 70680 | 11.5% |
| L | 65526 | 10.7% |
| M | 42654 | 7.0% |
| S | 1607 | 0.3% |
| P | 1564 | 0.3% |
| C | 982 | 0.2% |
| N | 744 | 0.1% |
| B | 158 | < 0.1% |
| Other values (8) | 312 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 213 | |
| 1 | 200 | |
| 7 | 175 | |
| 2 | 170 | |
| 0 | 94 | |
| 6 | 48 | 5.1% |
| 8 | 17 | 1.8% |
| 4 | 14 | 1.5% |
| 3 | 9 | 1.0% |
| 5 | 2 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 85 | |
| & | 49 | |
| / | 48 | |
| . | 43 | |
| : | 21 | 8.4% |
| " | 2 | 0.8% |
| ; | 2 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 166905 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2435970 | |
| Common | 168107 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 402567 | |
| e | 238609 | |
| a | 237477 | |
| G | 236541 | |
| t | 194803 | |
| E | 191420 | |
| l | 169480 | |
| g | 163810 | |
| r | 124366 | 5.1% |
| h | 120864 | 5.0% |
| Other values (32) | 356033 |
Common
| Value | Count | Frequency (%) |
| 166905 | ||
| 9 | 213 | 0.1% |
| 1 | 200 | 0.1% |
| 7 | 175 | 0.1% |
| 2 | 170 | 0.1% |
| 0 | 94 | 0.1% |
| , | 85 | 0.1% |
| & | 49 | < 0.1% |
| / | 48 | < 0.1% |
| 6 | 48 | < 0.1% |
| Other values (9) | 120 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2604077 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 402567 | |
| e | 238609 | |
| a | 237477 | |
| G | 236541 | |
| t | 194803 | |
| E | 191420 | |
| l | 169480 | 6.5% |
| 166905 | 6.4% | |
| g | 163810 | 6.3% |
| r | 124366 | 4.8% |
| Other values (51) | 478099 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7.5 |
| Mean length | 7.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 9 March |
|---|---|
| 2nd row | 8.v.1973 |
| Value | Count | Frequency (%) |
| 9 | 1 | |
| march | 1 | |
| 8.v.1973 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 2 | |
| . | 2 | |
| 1 | 6.7% | |
| M | 1 | 6.7% |
| a | 1 | 6.7% |
| r | 1 | 6.7% |
| c | 1 | 6.7% |
| h | 1 | 6.7% |
| 8 | 1 | 6.7% |
| v | 1 | 6.7% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Lowercase Letter | 5 | |
| Other Punctuation | 2 | 13.3% |
| Space Separator | 1 | 6.7% |
| Uppercase Letter | 1 | 6.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 8 | 1 | |
| 1 | 1 | |
| 7 | 1 | |
| 3 | 1 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| r | 1 | |
| c | 1 | |
| h | 1 | |
| v | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9 | |
| Latin | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 2 | |
| . | 2 | |
| 1 | ||
| 8 | 1 | |
| 1 | 1 | |
| 7 | 1 | |
| 3 | 1 |
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| a | 1 | |
| r | 1 | |
| c | 1 | |
| h | 1 | |
| v | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 2 | |
| . | 2 | |
| 1 | 6.7% | |
| M | 1 | 6.7% |
| a | 1 | 6.7% |
| r | 1 | 6.7% |
| c | 1 | 6.7% |
| h | 1 | 6.7% |
| 8 | 1 | 6.7% |
| v | 1 | 6.7% |
| Other values (3) | 3 |
Missing 
| Distinct | 1134 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 596178 |
| Missing (%) | 98.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 200 |
|---|---|
| Median length | 182 |
| Mean length | 45.17341383 |
| Min length | 10 |
Unique
| Unique | 400 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | Coordinate Uncertainty In Meters: 56182 |
|---|---|
| 2nd row | Coordinate Uncertainty In Meters: 49611 |
| 3rd row | Coordinate Uncertainty In Meters: 97700 |
| 4th row | Coordinate Uncertainty In Meters: 41787 |
| 5th row | Coordinate Uncertainty In Meters: 71236 |
| Value | Count | Frequency (%) |
| in | 8278 | |
| coordinate | 8139 | |
| meters | 8139 | |
| uncertainty | 8139 | |
| verbatim | 1307 | 2.7% |
| coordinate-degrees | 1307 | 2.7% |
| minutes | 1307 | 2.7% |
| 3792 | 274 | 0.6% |
| the | 221 | 0.5% |
| 6066 | 174 | 0.4% |
| Other values (1273) | 10423 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 42267 | 11.1% |
| 39260 | 10.3% | |
| t | 37512 | 9.8% |
| n | 36163 | 9.5% |
| r | 29378 | 7.7% |
| i | 21344 | 5.6% |
| o | 20135 | 5.3% |
| a | 19989 | 5.2% |
| s | 11758 | 3.1% |
| d | 9749 | 2.6% |
| Other values (59) | 114070 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 254955 | |
| Space Separator | 39260 | 10.3% |
| Decimal Number | 38767 | 10.2% |
| Uppercase Letter | 37565 | 9.8% |
| Other Punctuation | 9665 | 2.5% |
| Dash Punctuation | 1342 | 0.4% |
| Open Punctuation | 33 | < 0.1% |
| Close Punctuation | 33 | < 0.1% |
| Initial Punctuation | 2 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 42267 | |
| t | 37512 | |
| n | 36163 | |
| r | 29378 | |
| i | 21344 | |
| o | 20135 | |
| a | 19989 | |
| s | 11758 | 4.6% |
| d | 9749 | 3.8% |
| c | 8645 | 3.4% |
| Other values (16) | 18015 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 9645 | |
| M | 8186 | |
| U | 8173 | |
| I | 8160 | |
| D | 1329 | 3.5% |
| V | 1307 | 3.5% |
| T | 264 | 0.7% |
| N | 88 | 0.2% |
| S | 85 | 0.2% |
| G | 82 | 0.2% |
| Other values (10) | 246 | 0.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4554 | |
| 6 | 4450 | |
| 0 | 4424 | |
| 3 | 4271 | |
| 2 | 4116 | |
| 5 | 3996 | |
| 4 | 3411 | |
| 7 | 3300 | |
| 9 | 3146 | |
| 8 | 3099 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 8139 | |
| ; | 1326 | 13.7% |
| , | 101 | 1.0% |
| . | 90 | 0.9% |
| ' | 5 | 0.1% |
| " | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 39260 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1342 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 33 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 33 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 2 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 292520 | |
| Common | 89105 | 23.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 42267 | |
| t | 37512 | |
| n | 36163 | |
| r | 29378 | |
| i | 21344 | 7.3% |
| o | 20135 | 6.9% |
| a | 19989 | 6.8% |
| s | 11758 | 4.0% |
| d | 9749 | 3.3% |
| C | 9645 | 3.3% |
| Other values (36) | 54580 |
Common
| Value | Count | Frequency (%) |
| 39260 | ||
| : | 8139 | 9.1% |
| 1 | 4554 | 5.1% |
| 6 | 4450 | 5.0% |
| 0 | 4424 | 5.0% |
| 3 | 4271 | 4.8% |
| 2 | 4116 | 4.6% |
| 5 | 3996 | 4.5% |
| 4 | 3411 | 3.8% |
| 7 | 3300 | 3.7% |
| Other values (13) | 9184 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 381616 | |
| None | 5 | < 0.1% |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 42267 | 11.1% |
| 39260 | 10.3% | |
| t | 37512 | 9.8% |
| n | 36163 | 9.5% |
| r | 29378 | 7.7% |
| i | 21344 | 5.6% |
| o | 20135 | 5.3% |
| a | 19989 | 5.2% |
| s | 11758 | 3.1% |
| d | 9749 | 2.6% |
| Other values (56) | 114061 |
None
| Value | Count | Frequency (%) |
| ñ | 5 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 2 | |
| ” | 2 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 70 |
|---|---|
| Median length | 65.5 |
| Mean length | 65.5 |
| Min length | 61 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia, Arthropoda, Insecta, Lepidoptera, Papilionidae, Papilioninae |
|---|---|
| 2nd row | Animalia, Arthropoda, Insecta, Odonata, Anisoptera, Aeshnidae |
| Value | Count | Frequency (%) |
| animalia | 2 | |
| arthropoda | 2 | |
| insecta | 2 | |
| lepidoptera | 1 | |
| papilionidae | 1 | |
| papilioninae | 1 | |
| odonata | 1 | |
| anisoptera | 1 | |
| aeshnidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 17 | |
| i | 13 | 9.9% |
| , | 10 | 7.6% |
| 10 | 7.6% | |
| n | 10 | 7.6% |
| e | 9 | 6.9% |
| o | 9 | 6.9% |
| t | 7 | 5.3% |
| p | 7 | 5.3% |
| A | 6 | 4.6% |
| Other values (11) | 33 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 99 | |
| Uppercase Letter | 12 | 9.2% |
| Other Punctuation | 10 | 7.6% |
| Space Separator | 10 | 7.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 17 | |
| i | 13 | |
| n | 10 | |
| e | 9 | |
| o | 9 | |
| t | 7 | |
| p | 7 | |
| r | 6 | 6.1% |
| d | 6 | 6.1% |
| s | 4 | 4.0% |
| Other values (4) | 11 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6 | |
| I | 2 | 16.7% |
| P | 2 | 16.7% |
| L | 1 | 8.3% |
| O | 1 | 8.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 111 | |
| Common | 20 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 17 | |
| i | 13 | |
| n | 10 | |
| e | 9 | |
| o | 9 | |
| t | 7 | 6.3% |
| p | 7 | 6.3% |
| A | 6 | 5.4% |
| r | 6 | 5.4% |
| d | 6 | 5.4% |
| Other values (9) | 21 |
Common
| Value | Count | Frequency (%) |
| , | 10 | |
| 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 131 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 17 | |
| i | 13 | 9.9% |
| , | 10 | 7.6% |
| 10 | 7.6% | |
| n | 10 | 7.6% |
| e | 9 | 6.9% |
| o | 9 | 6.9% |
| t | 7 | 5.3% |
| p | 7 | 5.3% |
| A | 6 | 4.6% |
| Other values (11) | 33 |
earliestEraOrLowestErathem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| A | 2 | |
| n | 2 | |
| m | 2 | |
| l | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Uppercase Letter | 2 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| n | 2 | |
| m | 2 | |
| l | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| A | 2 | |
| n | 2 | |
| m | 2 | |
| l | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| A | 2 | |
| n | 2 | |
| m | 2 | |
| l | 2 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Arthropoda |
|---|---|
| 2nd row | Arthropoda |
| Value | Count | Frequency (%) |
| arthropoda | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 4 | |
| o | 4 | |
| A | 2 | |
| t | 2 | |
| h | 2 | |
| p | 2 | |
| d | 2 | |
| a | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Uppercase Letter | 2 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 4 | |
| o | 4 | |
| t | 2 | |
| h | 2 | |
| p | 2 | |
| d | 2 | |
| a | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 4 | |
| o | 4 | |
| A | 2 | |
| t | 2 | |
| h | 2 | |
| p | 2 | |
| d | 2 | |
| a | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 4 | |
| o | 4 | |
| A | 2 | |
| t | 2 | |
| h | 2 | |
| p | 2 | |
| d | 2 | |
| a | 2 |
earliestPeriodOrLowestSystem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Insecta |
|---|---|
| 2nd row | Insecta |
| Value | Count | Frequency (%) |
| insecta | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 2 | |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 2 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 2 | |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 2 | |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
latestPeriodOrHighestSystem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Lepidoptera |
|---|---|
| 2nd row | Odonata |
| Value | Count | Frequency (%) |
| lepidoptera | 1 | |
| odonata | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| p | 2 | |
| d | 2 | |
| o | 2 | |
| t | 2 | |
| L | 1 | 5.6% |
| i | 1 | 5.6% |
| r | 1 | 5.6% |
| O | 1 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 2 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| p | 2 | |
| d | 2 | |
| o | 2 | |
| t | 2 | |
| i | 1 | 6.2% |
| r | 1 | 6.2% |
| n | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1 | |
| O | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| p | 2 | |
| d | 2 | |
| o | 2 | |
| t | 2 | |
| L | 1 | 5.6% |
| i | 1 | 5.6% |
| r | 1 | 5.6% |
| O | 1 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| p | 2 | |
| d | 2 | |
| o | 2 | |
| t | 2 | |
| L | 1 | 5.6% |
| i | 1 | 5.6% |
| r | 1 | 5.6% |
| O | 1 | 5.6% |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604622 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 10.5 |
| Mean length | 14.25 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Papilionidae |
|---|---|
| 2nd row | United States, Florida, Pinellas |
| 3rd row | Aeshnidae |
| 4th row | Peru |
| Value | Count | Frequency (%) |
| papilionidae | 1 | |
| united | 1 | |
| states | 1 | |
| florida | 1 | |
| pinellas | 1 | |
| aeshnidae | 1 | |
| peru | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 7 | |
| a | 6 | |
| l | 4 | 7.0% |
| n | 4 | 7.0% |
| d | 4 | 7.0% |
| P | 3 | 5.3% |
| s | 3 | 5.3% |
| 3 | 5.3% | |
| t | 3 | 5.3% |
| Other values (10) | 13 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45 | |
| Uppercase Letter | 7 | 12.3% |
| Space Separator | 3 | 5.3% |
| Other Punctuation | 2 | 3.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 7 | |
| a | 6 | |
| l | 4 | |
| n | 4 | |
| d | 4 | |
| s | 3 | |
| t | 3 | |
| o | 2 | 4.4% |
| r | 2 | 4.4% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3 | |
| U | 1 | 14.3% |
| S | 1 | 14.3% |
| F | 1 | 14.3% |
| A | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 52 | |
| Common | 5 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 7 | |
| a | 6 | |
| l | 4 | |
| n | 4 | |
| d | 4 | |
| P | 3 | 5.8% |
| s | 3 | 5.8% |
| t | 3 | 5.8% |
| o | 2 | 3.8% |
| Other values (8) | 9 |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| , | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 7 | |
| a | 6 | |
| l | 4 | 7.0% |
| n | 4 | 7.0% |
| d | 4 | 7.0% |
| P | 3 | 5.3% |
| s | 3 | 5.3% |
| 3 | 5.3% | |
| t | 3 | 5.3% |
| Other values (10) | 13 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | SOUTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 1 | |
| south_america | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4 | |
| R | 3 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| _ | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 | |
| C | 2 | |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 24 | |
| Connector Punctuation | 2 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| R | 3 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 | |
| C | 2 | |
| N | 1 | 4.2% |
| Other values (2) | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24 | |
| Common | 2 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| R | 3 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 | |
| C | 2 | |
| N | 1 | 4.2% |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 4 | |
| R | 3 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| _ | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 | |
| C | 2 | |
| Other values (3) | 3 |
highestBiostratigraphicZone
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8.5 |
| Mean length | 8.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Troides |
|---|---|
| 2nd row | Gynacantha |
| Value | Count | Frequency (%) |
| troides | 1 | |
| gynacantha | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| T | 1 | 5.9% |
| r | 1 | 5.9% |
| o | 1 | 5.9% |
| i | 1 | 5.9% |
| d | 1 | 5.9% |
| e | 1 | 5.9% |
| s | 1 | 5.9% |
| G | 1 | 5.9% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Uppercase Letter | 2 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| r | 1 | 6.7% |
| o | 1 | 6.7% |
| i | 1 | 6.7% |
| d | 1 | 6.7% |
| e | 1 | 6.7% |
| s | 1 | 6.7% |
| y | 1 | 6.7% |
| c | 1 | 6.7% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| T | 1 | 5.9% |
| r | 1 | 5.9% |
| o | 1 | 5.9% |
| i | 1 | 5.9% |
| d | 1 | 5.9% |
| e | 1 | 5.9% |
| s | 1 | 5.9% |
| G | 1 | 5.9% |
| Other values (4) | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| T | 1 | 5.9% |
| r | 1 | 5.9% |
| o | 1 | 5.9% |
| i | 1 | 5.9% |
| d | 1 | 5.9% |
| e | 1 | 5.9% |
| s | 1 | 5.9% |
| G | 1 | 5.9% |
| Other values (4) | 4 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604622 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8.5 |
| Mean length | 5.25 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Troides |
|---|---|
| 2nd row | US |
| 3rd row | Gynacantha |
| 4th row | PE |
| Value | Count | Frequency (%) |
| troides | 1 | |
| us | 1 | |
| gynacantha | 1 | |
| pe | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | 14.3% |
| n | 2 | 9.5% |
| T | 1 | 4.8% |
| r | 1 | 4.8% |
| P | 1 | 4.8% |
| h | 1 | 4.8% |
| t | 1 | 4.8% |
| c | 1 | 4.8% |
| y | 1 | 4.8% |
| G | 1 | 4.8% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Uppercase Letter | 6 | 28.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| r | 1 | 6.7% |
| h | 1 | 6.7% |
| t | 1 | 6.7% |
| c | 1 | 6.7% |
| y | 1 | 6.7% |
| s | 1 | 6.7% |
| e | 1 | 6.7% |
| d | 1 | 6.7% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| P | 1 | |
| G | 1 | |
| S | 1 | |
| U | 1 | |
| E | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | 14.3% |
| n | 2 | 9.5% |
| T | 1 | 4.8% |
| r | 1 | 4.8% |
| P | 1 | 4.8% |
| h | 1 | 4.8% |
| t | 1 | 4.8% |
| c | 1 | 4.8% |
| y | 1 | 4.8% |
| G | 1 | 4.8% |
| Other values (8) | 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | 14.3% |
| n | 2 | 9.5% |
| T | 1 | 4.8% |
| r | 1 | 4.8% |
| P | 1 | 4.8% |
| h | 1 | 4.8% |
| t | 1 | 4.8% |
| c | 1 | 4.8% |
| y | 1 | 4.8% |
| G | 1 | 4.8% |
| Other values (8) | 8 |
group
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Florida |
|---|
| Value | Count | Frequency (%) |
| florida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 |
formation
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Pinellas |
|---|
| Value | Count | Frequency (%) |
| pinellas | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2 | |
| P | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2 | |
| P | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2 | |
| P | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
member
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 9 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | amphrysus |
|---|---|
| 2nd row | membranalis |
| Value | Count | Frequency (%) |
| amphrysus | 1 | |
| membranalis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| m | 3 | |
| s | 3 | |
| r | 2 | |
| p | 1 | 5.0% |
| h | 1 | 5.0% |
| y | 1 | 5.0% |
| u | 1 | 5.0% |
| e | 1 | 5.0% |
| b | 1 | 5.0% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| m | 3 | |
| s | 3 | |
| r | 2 | |
| p | 1 | 5.0% |
| h | 1 | 5.0% |
| y | 1 | 5.0% |
| u | 1 | 5.0% |
| e | 1 | 5.0% |
| b | 1 | 5.0% |
| Other values (3) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| m | 3 | |
| s | 3 | |
| r | 2 | |
| p | 1 | 5.0% |
| h | 1 | 5.0% |
| y | 1 | 5.0% |
| u | 1 | 5.0% |
| e | 1 | 5.0% |
| b | 1 | 5.0% |
| Other values (3) | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| m | 3 | |
| s | 3 | |
| r | 2 | |
| p | 1 | 5.0% |
| h | 1 | 5.0% |
| y | 1 | 5.0% |
| u | 1 | 5.0% |
| e | 1 | 5.0% |
| b | 1 | 5.0% |
| Other values (3) | 3 |
bed
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 22.5 |
| Mean length | 22.5 |
| Min length | 14 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | St. Petersburg |
|---|---|
| 2nd row | Huaru Valley, 90 mi. N. of Lima |
| Value | Count | Frequency (%) |
| st | 1 | |
| petersburg | 1 | |
| huaru | 1 | |
| valley | 1 | |
| 90 | 1 | |
| mi | 1 | |
| n | 1 | |
| of | 1 | |
| lima | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | ||
| a | 3 | 6.7% |
| . | 3 | 6.7% |
| e | 3 | 6.7% |
| r | 3 | 6.7% |
| u | 3 | 6.7% |
| i | 2 | 4.4% |
| m | 2 | 4.4% |
| t | 2 | 4.4% |
| l | 2 | 4.4% |
| Other values (15) | 15 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26 | |
| Space Separator | 7 | 15.6% |
| Uppercase Letter | 6 | 13.3% |
| Other Punctuation | 4 | 8.9% |
| Decimal Number | 2 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| r | 3 | |
| u | 3 | |
| i | 2 | |
| m | 2 | |
| t | 2 | |
| l | 2 | |
| f | 1 | 3.8% |
| o | 1 | 3.8% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| S | 1 | |
| V | 1 | |
| H | 1 | |
| P | 1 | |
| L | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 1 | 25.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 9 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32 | |
| Common | 13 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | 9.4% |
| e | 3 | 9.4% |
| r | 3 | 9.4% |
| u | 3 | 9.4% |
| i | 2 | 6.2% |
| m | 2 | 6.2% |
| t | 2 | 6.2% |
| l | 2 | 6.2% |
| f | 1 | 3.1% |
| o | 1 | 3.1% |
| Other values (10) | 10 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| . | 3 | |
| , | 1 | 7.7% |
| 0 | 1 | 7.7% |
| 9 | 1 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | ||
| a | 3 | 6.7% |
| . | 3 | 6.7% |
| e | 3 | 6.7% |
| r | 3 | 6.7% |
| u | 3 | 6.7% |
| i | 2 | 4.4% |
| m | 2 | 4.4% |
| t | 2 | 4.4% |
| l | 2 | 4.4% |
| Other values (15) | 15 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| Value | Count | Frequency (%) |
| species | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 14 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
Missing 
| Distinct | 15 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 603189 |
| Missing (%) | 99.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 5.812108559 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | near |
|---|---|
| 2nd row | uncertain |
| 3rd row | near |
| 4th row | near |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| near | 466 | |
| uncertain | 459 | |
| cf | 238 | |
| group | 113 | 7.7% |
| subgroup | 80 | 5.4% |
| complex | 26 | 1.8% |
| aff | 21 | 1.4% |
| sp | 21 | 1.4% |
| n | 15 | 1.0% |
| sensu | 11 | 0.7% |
| Other values (5) | 23 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1131 | |
| e | 962 | |
| a | 947 | |
| u | 743 | |
| c | 732 | |
| t | 481 | 5.8% |
| i | 470 | 5.6% |
| f | 280 | 3.4% |
| p | 240 | 2.9% |
| Other values (12) | 948 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8132 | |
| Other Punctuation | 180 | 2.2% |
| Space Separator | 36 | 0.4% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1131 | |
| e | 962 | |
| a | 947 | |
| u | 743 | |
| c | 732 | |
| t | 481 | 5.9% |
| i | 470 | 5.8% |
| f | 280 | 3.4% |
| p | 240 | 3.0% |
| Other values (8) | 728 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| B | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 180 |
Space Separator
| Value | Count | Frequency (%) |
| 36 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8136 | |
| Common | 216 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1131 | |
| e | 962 | |
| a | 947 | |
| u | 743 | |
| c | 732 | |
| t | 481 | 5.9% |
| i | 470 | 5.8% |
| f | 280 | 3.4% |
| p | 240 | 2.9% |
| Other values (10) | 732 |
Common
| Value | Count | Frequency (%) |
| . | 180 | |
| 36 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1131 | |
| e | 962 | |
| a | 947 | |
| u | 743 | |
| c | 732 | |
| t | 481 | 5.8% |
| i | 470 | 5.6% |
| f | 280 | 3.4% |
| p | 240 | 2.9% |
| Other values (12) | 948 |
typeStatus
Text
Missing 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 486591 |
| Missing (%) | 80.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 6.818274241 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PARATYPE |
|---|---|
| 2nd row | TYPE |
| 3rd row | HOLOTYPE |
| 4th row | TYPE |
| 5th row | SYNTYPE |
| Value | Count | Frequency (%) |
| holotype | 53956 | |
| type | 32775 | |
| syntype | 13266 | 11.2% |
| paratype | 11028 | 9.3% |
| lectotype | 5190 | 4.4% |
| allotype | 1078 | 0.9% |
| neotype | 315 | 0.3% |
| cotype | 303 | 0.3% |
| paralectotype | 120 | 0.1% |
| paraneotype | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 131301 | |
| P | 129186 | |
| E | 123664 | |
| T | 123345 | |
| O | 114923 | |
| L | 61424 | |
| H | 53956 | |
| A | 23381 | 2.9% |
| N | 13585 | 1.7% |
| S | 13266 | 1.6% |
| Other values (2) | 16764 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 804795 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 131301 | |
| P | 129186 | |
| E | 123664 | |
| T | 123345 | |
| O | 114923 | |
| L | 61424 | |
| H | 53956 | |
| A | 23381 | 2.9% |
| N | 13585 | 1.7% |
| S | 13266 | 1.6% |
| Other values (2) | 16764 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 804795 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 131301 | |
| P | 129186 | |
| E | 123664 | |
| T | 123345 | |
| O | 114923 | |
| L | 61424 | |
| H | 53956 | |
| A | 23381 | 2.9% |
| N | 13585 | 1.7% |
| S | 13266 | 1.6% |
| Other values (2) | 16764 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 804795 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 131301 | |
| P | 129186 | |
| E | 123664 | |
| T | 123345 | |
| O | 114923 | |
| L | 61424 | |
| H | 53956 | |
| A | 23381 | 2.9% |
| N | 13585 | 1.7% |
| S | 13266 | 1.6% |
| Other values (2) | 16764 | 2.1% |
identifiedBy
Text
Missing 
| Distinct | 2736 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 454955 |
| Missing (%) | 75.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 150 |
|---|---|
| Median length | 106 |
| Mean length | 27.79390129 |
| Min length | 2 |
Unique
| Unique | 933 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Westfall, M. J., Jr. |
|---|---|
| 2nd row | Donnelly, Thomas W. |
| 3rd row | Flint, Oliver S., Jr., (ENT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 4th row | Kormann, K. |
| 5th row | DeMarmels |
| Value | Count | Frequency (%) |
| w | 28127 | 4.4% |
| united | 24410 | 3.8% |
| states | 24409 | 3.8% |
| 22736 | 3.5% | |
| of | 21999 | 3.4% |
| s | 21914 | 3.4% |
| smithsonian | 21909 | 3.4% |
| institution | 21909 | 3.4% |
| museum | 21366 | 3.3% |
| natural | 21088 | 3.3% |
| Other values (2399) | 413039 |
Most occurring characters
| Value | Count | Frequency (%) |
| 493235 | 11.9% | |
| i | 250985 | 6.0% |
| o | 231935 | 5.6% |
| t | 230915 | 5.6% |
| n | 230479 | 5.5% |
| a | 200365 | 4.8% |
| , | 193541 | 4.7% |
| r | 182836 | 4.4% |
| . | 170349 | 4.1% |
| s | 166922 | 4.0% |
| Other values (61) | 1808379 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2295249 | |
| Uppercase Letter | 890430 | 21.4% |
| Space Separator | 493235 | 11.9% |
| Other Punctuation | 364740 | 8.8% |
| Close Punctuation | 46598 | 1.1% |
| Open Punctuation | 46598 | 1.1% |
| Dash Punctuation | 23091 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 250985 | |
| o | 231935 | |
| t | 230915 | |
| n | 230479 | |
| a | 200365 | |
| r | 182836 | |
| s | 166922 | |
| l | 162341 | |
| e | 157389 | |
| u | 114794 | 5.0% |
| Other values (23) | 366288 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 112733 | |
| S | 105388 | |
| N | 90465 | |
| E | 79705 | 9.0% |
| M | 58701 | 6.6% |
| D | 53037 | 6.0% |
| I | 47394 | 5.3% |
| A | 45385 | 5.1% |
| W | 36642 | 4.1% |
| J | 36226 | 4.1% |
| Other values (16) | 224754 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 193541 | |
| . | 170349 | |
| & | 690 | 0.2% |
| ' | 157 | < 0.1% |
| ; | 2 | < 0.1% |
| ? | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 46596 | |
| ] | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 46596 | |
| [ | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 493235 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3185679 | |
| Common | 974262 | 23.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 250985 | 7.9% |
| o | 231935 | 7.3% |
| t | 230915 | 7.2% |
| n | 230479 | 7.2% |
| a | 200365 | 6.3% |
| r | 182836 | 5.7% |
| s | 166922 | 5.2% |
| l | 162341 | 5.1% |
| e | 157389 | 4.9% |
| u | 114794 | 3.6% |
| Other values (49) | 1256718 |
Common
| Value | Count | Frequency (%) |
| 493235 | ||
| , | 193541 | 19.9% |
| . | 170349 | 17.5% |
| ) | 46596 | 4.8% |
| ( | 46596 | 4.8% |
| - | 23091 | 2.4% |
| & | 690 | 0.1% |
| ' | 157 | < 0.1% |
| [ | 2 | < 0.1% |
| ] | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4159904 | |
| None | 37 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 493235 | 11.9% | |
| i | 250985 | 6.0% |
| o | 231935 | 5.6% |
| t | 230915 | 5.6% |
| n | 230479 | 5.5% |
| a | 200365 | 4.8% |
| , | 193541 | 4.7% |
| r | 182836 | 4.4% |
| . | 170349 | 4.1% |
| s | 166922 | 4.0% |
| Other values (54) | 1808342 |
None
| Value | Count | Frequency (%) |
| á | 9 | |
| ń | 9 | |
| ż | 9 | |
| ö | 7 | |
| ü | 1 | 2.7% |
| è | 1 | 2.7% |
| ä | 1 | 2.7% |
identifiedByID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
identificationVerificationStatus
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 604622 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 22 |
| Mean length | 21.75 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 27.7731 |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | -4.55006 |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 2 | |
| 27.7731 | 1 | |
| 4.55006 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 9 | |
| c | 8 | 9.2% |
| a | 8 | 9.2% |
| 2 | 7 | 8.0% |
| 4 | 7 | 8.0% |
| b | 6 | 6.9% |
| 5 | 6 | 6.9% |
| 3 | 5 | 5.7% |
| 7 | 5 | 5.7% |
| 9 | 4 | 4.6% |
| Other values (7) | 22 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 48 | |
| Lowercase Letter | 28 | |
| Dash Punctuation | 9 | 10.3% |
| Other Punctuation | 2 | 2.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7 | |
| 4 | 7 | |
| 5 | 6 | |
| 3 | 5 | |
| 7 | 5 | |
| 9 | 4 | |
| 0 | 4 | |
| 8 | 4 | |
| 1 | 3 | |
| 6 | 3 |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8 | |
| a | 8 | |
| b | 6 | |
| d | 4 | |
| e | 2 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 59 | |
| Latin | 28 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 9 | |
| 2 | 7 | |
| 4 | 7 | |
| 5 | 6 | |
| 3 | 5 | |
| 7 | 5 | |
| 9 | 4 | |
| 0 | 4 | |
| 8 | 4 | |
| 1 | 3 | 5.1% |
| Other values (2) | 5 |
Latin
| Value | Count | Frequency (%) |
| c | 8 | |
| a | 8 | |
| b | 6 | |
| d | 4 | |
| e | 2 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 87 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 9 | |
| c | 8 | 9.2% |
| a | 8 | 9.2% |
| 2 | 7 | 8.0% |
| 4 | 7 | 8.0% |
| b | 6 | 6.9% |
| 5 | 6 | 6.9% |
| 3 | 5 | 5.7% |
| 7 | 5 | 5.7% |
| 9 | 4 | 4.6% |
| Other values (7) | 22 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 604622 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 4.5 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | -82.64 |
| 3rd row | US |
| 4th row | -76.1874 |
| Value | Count | Frequency (%) |
| us | 2 | |
| 82.64 | 1 | |
| 76.1874 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 | |
| - | 2 | |
| 8 | 2 | |
| . | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10 | |
| Uppercase Letter | 4 | 22.2% |
| Dash Punctuation | 2 | 11.1% |
| Other Punctuation | 2 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 | |
| Latin | 4 | 22.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2 | |
| 8 | 2 | |
| . | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Latin
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 | |
| - | 2 | |
| 8 | 2 | |
| . | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
taxonID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-12-02T13:59:16.382Z |
|---|---|
| 2nd row | 2024-12-02T13:59:48.546Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:59:16.382z | 1 | |
| 2024-12-02t13:59:48.546z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 1 | 5 | |
| 0 | 4 | |
| 4 | 4 | |
| - | 4 | |
| : | 4 | |
| 3 | 3 | 6.2% |
| 5 | 3 | 6.2% |
| T | 2 | 4.2% |
| 9 | 2 | 4.2% |
| Other values (4) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34 | |
| Other Punctuation | 6 | 12.5% |
| Dash Punctuation | 4 | 8.3% |
| Uppercase Letter | 4 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 1 | 5 | |
| 0 | 4 | |
| 4 | 4 | |
| 3 | 3 | 8.8% |
| 5 | 3 | 8.8% |
| 9 | 2 | 5.9% |
| 6 | 2 | 5.9% |
| 8 | 2 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 | |
| . | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 44 | |
| Latin | 4 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 1 | 5 | |
| 0 | 4 | |
| 4 | 4 | |
| - | 4 | |
| : | 4 | |
| 3 | 3 | 6.8% |
| 5 | 3 | 6.8% |
| 9 | 2 | 4.5% |
| 6 | 2 | 4.5% |
| Other values (2) | 4 |
Latin
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 1 | 5 | |
| 0 | 4 | |
| 4 | 4 | |
| - | 4 | |
| : | 4 | |
| 3 | 3 | 6.2% |
| 5 | 3 | 6.2% |
| T | 2 | 4.2% |
| 9 | 2 | 4.2% |
| Other values (4) | 8 |
| Distinct | 188378 |
|---|---|
| Distinct (%) | 31.4% |
| Missing | 4648 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.955165023 |
| Min length | 1 |
Unique
| Unique | 134600 ? |
|---|---|
| Unique (%) | 22.4% |
Sample
| 1st row | 7866975 |
|---|---|
| 2nd row | 5122189 |
| 3rd row | 1939887 |
| 4th row | 1422444 |
| 5th row | 4988370 |
| Value | Count | Frequency (%) |
| 1340278 | 10672 | 1.8% |
| 1340525 | 6265 | 1.0% |
| 1340393 | 4073 | 0.7% |
| 10409744 | 3623 | 0.6% |
| 789 | 3466 | 0.6% |
| 1340467 | 3343 | 0.6% |
| 9164 | 3176 | 0.5% |
| 1340350 | 3129 | 0.5% |
| 1341979 | 2431 | 0.4% |
| 1340485 | 2119 | 0.4% |
| Other values (188368) | 557681 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4172946 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4172946 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4172946 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 112 |
|---|---|
| Median length | 80 |
| Mean length | 80 |
| Min length | 48 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 1 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 16 | |
| E | 15 | |
| R | 13 | 8.1% |
| I | 12 | 7.5% |
| D | 12 | 7.5% |
| N | 12 | 7.5% |
| T | 11 | 6.9% |
| C | 11 | 6.9% |
| O | 11 | 6.9% |
| U | 10 | 6.2% |
| Other values (11) | 37 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 140 | |
| Connector Punctuation | 16 | 10.0% |
| Other Punctuation | 2 | 1.2% |
| Decimal Number | 2 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 15 | |
| R | 13 | |
| I | 12 | |
| D | 12 | |
| N | 12 | |
| T | 11 | |
| C | 11 | |
| O | 11 | |
| U | 10 | 7.1% |
| S | 8 | 5.7% |
| Other values (7) | 25 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 4 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 16 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 140 | |
| Common | 20 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 15 | |
| R | 13 | |
| I | 12 | |
| D | 12 | |
| N | 12 | |
| T | 11 | |
| C | 11 | |
| O | 11 | |
| U | 10 | 7.1% |
| S | 8 | 5.7% |
| Other values (7) | 25 |
Common
| Value | Count | Frequency (%) |
| _ | 16 | |
| ; | 2 | 10.0% |
| 8 | 1 | 5.0% |
| 4 | 1 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 16 | |
| E | 15 | |
| R | 13 | 8.1% |
| I | 12 | 7.5% |
| D | 12 | 7.5% |
| N | 12 | 7.5% |
| T | 11 | 6.9% |
| C | 11 | 6.9% |
| O | 11 | 6.9% |
| U | 10 | 6.2% |
| Other values (11) | 37 |
taxonConceptID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | StillImage |
|---|
| Value | Count | Frequency (%) |
| stillimage | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2 | |
| S | 1 | |
| t | 1 | |
| i | 1 | |
| I | 1 | |
| m | 1 | |
| a | 1 | |
| g | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 2 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| t | 1 | |
| i | 1 | |
| m | 1 | |
| a | 1 | |
| g | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2 | |
| S | 1 | |
| t | 1 | |
| i | 1 | |
| I | 1 | |
| m | 1 | |
| a | 1 | |
| g | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2 | |
| S | 1 | |
| t | 1 | |
| i | 1 | |
| I | 1 | |
| m | 1 | |
| a | 1 | |
| g | 1 | |
| e | 1 |
scientificName
Text
| Distinct | 203338 |
|---|---|
| Distinct (%) | 33.6% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 239 |
|---|---|
| Median length | 108 |
| Mean length | 31.31866747 |
| Min length | 4 |
Unique
| Unique | 154758 ? |
|---|---|
| Unique (%) | 25.6% |
Sample
| 1st row | Camponotus rufoglaucus var. rufigenis Forel |
|---|---|
| 2nd row | Athrips mesoleuca Lower, 1900 |
| 3rd row | Paranthrene asilipennis (Boisduval, 1832) |
| 4th row | Acanthagrion trilobatum Leonard, 1977 |
| 5th row | Calathus nanulus Casey, 1920 |
| Value | Count | Frequency (%) |
| bombus | 62365 | 2.7% |
| 29343 | 1.3% | |
| hagen | 24881 | 1.1% |
| cresson | 24121 | 1.0% |
| 1861 | 19352 | 0.8% |
| fabricius | 16608 | 0.7% |
| 1863 | 16510 | 0.7% |
| selys | 15944 | 0.7% |
| casey | 15917 | 0.7% |
| latreille | 15270 | 0.7% |
| Other values (119252) | 2103492 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1739179 | 9.2% | |
| a | 1483659 | 7.8% |
| e | 1211161 | 6.4% |
| i | 1149549 | 6.1% |
| s | 1059970 | 5.6% |
| r | 959396 | 5.1% |
| o | 891386 | 4.7% |
| l | 793140 | 4.2% |
| n | 766475 | 4.0% |
| 1 | 670618 | 3.5% |
| Other values (99) | 8211485 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12719422 | |
| Decimal Number | 2282464 | 12.1% |
| Space Separator | 1739179 | 9.2% |
| Uppercase Letter | 1240430 | 6.6% |
| Other Punctuation | 615505 | 3.3% |
| Close Punctuation | 167008 | 0.9% |
| Open Punctuation | 167008 | 0.9% |
| Dash Punctuation | 5002 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1483659 | |
| e | 1211161 | 9.5% |
| i | 1149549 | 9.0% |
| s | 1059970 | 8.3% |
| r | 959396 | 7.5% |
| o | 891386 | 7.0% |
| l | 793140 | 6.2% |
| n | 766475 | 6.0% |
| u | 640369 | 5.0% |
| t | 634331 | 5.0% |
| Other values (47) | 3129986 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 149154 | |
| B | 127924 | 10.3% |
| S | 115986 | 9.4% |
| P | 89895 | 7.2% |
| A | 87131 | 7.0% |
| H | 83287 | 6.7% |
| L | 83192 | 6.7% |
| M | 63745 | 5.1% |
| D | 56317 | 4.5% |
| E | 50255 | 4.1% |
| Other values (23) | 333544 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 670618 | |
| 8 | 384509 | |
| 9 | 327352 | |
| 7 | 165872 | 7.3% |
| 0 | 131673 | 5.8% |
| 6 | 131257 | 5.8% |
| 3 | 128731 | 5.6% |
| 2 | 126306 | 5.5% |
| 5 | 111567 | 4.9% |
| 4 | 104579 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 572531 | |
| & | 29332 | 4.8% |
| . | 13436 | 2.2% |
| ' | 195 | < 0.1% |
| ? | 11 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1739179 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 167008 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 167008 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5002 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13959852 | |
| Common | 4976166 | 26.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1483659 | 10.6% |
| e | 1211161 | 8.7% |
| i | 1149549 | 8.2% |
| s | 1059970 | 7.6% |
| r | 959396 | 6.9% |
| o | 891386 | 6.4% |
| l | 793140 | 5.7% |
| n | 766475 | 5.5% |
| u | 640369 | 4.6% |
| t | 634331 | 4.5% |
| Other values (80) | 4370416 |
Common
| Value | Count | Frequency (%) |
| 1739179 | ||
| 1 | 670618 | 13.5% |
| , | 572531 | 11.5% |
| 8 | 384509 | 7.7% |
| 9 | 327352 | 6.6% |
| ) | 167008 | 3.4% |
| ( | 167008 | 3.4% |
| 7 | 165872 | 3.3% |
| 0 | 131673 | 2.6% |
| 6 | 131257 | 2.6% |
| Other values (9) | 519159 | 10.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18911224 | |
| None | 24794 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1739179 | 9.2% | |
| a | 1483659 | 7.8% |
| e | 1211161 | 6.4% |
| i | 1149549 | 6.1% |
| s | 1059970 | 5.6% |
| r | 959396 | 5.1% |
| o | 891386 | 4.7% |
| l | 793140 | 4.2% |
| n | 766475 | 4.1% |
| 1 | 670618 | 3.5% |
| Other values (61) | 8186691 |
None
| Value | Count | Frequency (%) |
| é | 9434 | |
| ü | 5136 | |
| ö | 3299 | 13.3% |
| å | 1792 | 7.2% |
| á | 1363 | 5.5% |
| ä | 1221 | 4.9% |
| ç | 858 | 3.5% |
| è | 790 | 3.2% |
| ó | 219 | 0.9% |
| í | 140 | 0.6% |
| Other values (28) | 542 | 2.2% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| Value | Count | Frequency (%) |
| false | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
parentNameUsage
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604623 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 8.666666667 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1937011 |
|---|---|
| 2nd row | Google Earth |
| 3rd row | 1424710 |
| Value | Count | Frequency (%) |
| 1937011 | 1 | |
| 1 | ||
| earth | 1 | |
| 1424710 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 4 | 2 | 7.7% |
| 7 | 2 | 7.7% |
| 0 | 2 | 7.7% |
| o | 2 | 7.7% |
| E | 1 | 3.8% |
| h | 1 | 3.8% |
| t | 1 | 3.8% |
| r | 1 | 3.8% |
| a | 1 | 3.8% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Lowercase Letter | 9 | |
| Uppercase Letter | 2 | 7.7% |
| Space Separator | 1 | 3.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2 | |
| h | 1 | |
| t | 1 | |
| r | 1 | |
| a | 1 | |
| e | 1 | |
| l | 1 | |
| g | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 4 | 2 | 14.3% |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| G | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15 | |
| Latin | 11 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2 | |
| E | 1 | |
| h | 1 | |
| t | 1 | |
| r | 1 | |
| a | 1 | |
| e | 1 | |
| l | 1 | |
| g | 1 | |
| G | 1 |
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 4 | 2 | 13.3% |
| 7 | 2 | 13.3% |
| 0 | 2 | 13.3% |
| 1 | 6.7% | |
| 9 | 1 | 6.7% |
| 3 | 1 | 6.7% |
| 2 | 1 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 4 | 2 | 7.7% |
| 7 | 2 | 7.7% |
| 0 | 2 | 7.7% |
| o | 2 | 7.7% |
| E | 1 | 3.8% |
| h | 1 | 3.8% |
| t | 1 | 3.8% |
| r | 1 | 3.8% |
| a | 1 | 3.8% |
| Other values (8) | 8 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1937011 |
|---|---|
| 2nd row | 1424710 |
| Value | Count | Frequency (%) |
| 1937011 | 1 | |
| 1424710 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
nameAccordingTo
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| Value | Count | Frequency (%) |
| 1 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 |
namePublishedIn
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 54 |
|---|---|
| 2nd row | 54 |
| Value | Count | Frequency (%) |
| 54 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 216 |
|---|---|
| 2nd row | 216 |
| Value | Count | Frequency (%) |
| 216 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 1 | 2 | |
| 6 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 1 | 2 | |
| 6 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 1 | 2 | |
| 6 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 1 | 2 | |
| 6 | 2 |
| Distinct | 3456 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 4647 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 97 |
|---|---|
| Median length | 91 |
| Mean length | 62.39093868 |
| Min length | 3 |
Unique
| Unique | 577 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia, Arthropoda, Insecta, Hymenoptera, Formicidae, Formicinae |
|---|---|
| 2nd row | Animalia, Arthropoda, Insecta, Lepidoptera, Gelechiidae, Gelechiinae |
| 3rd row | Animalia, Arthropoda, Insecta, Lepidoptera, Sesiidae, Sesiinae |
| 4th row | Animalia, Arthropoda, Insecta, Odonata, Zygoptera, Coenagrionidae |
| 5th row | Animalia, Arthropoda, Insecta, Coleoptera, Carabidae |
| Value | Count | Frequency (%) |
| arthropoda | 599697 | |
| animalia | 598328 | |
| insecta | 587915 | |
| hymenoptera | 146500 | 4.2% |
| odonata | 117281 | 3.4% |
| lepidoptera | 99941 | 2.9% |
| apidae | 82932 | 2.4% |
| diptera | 73535 | 2.1% |
| coleoptera | 72078 | 2.1% |
| apinae | 63521 | 1.8% |
| Other values (2938) | 1026036 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4570771 | |
| e | 2938293 | 7.8% |
| 2867785 | 7.7% | |
| , | 2867419 | 7.7% |
| i | 2865053 | 7.7% |
| o | 2432895 | 6.5% |
| r | 2316840 | 6.2% |
| t | 2192044 | 5.9% |
| n | 2160053 | 5.8% |
| p | 1690137 | 4.5% |
| Other values (53) | 10531963 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28230630 | |
| Uppercase Letter | 3467330 | 9.3% |
| Space Separator | 2867785 | 7.7% |
| Other Punctuation | 2867488 | 7.7% |
| Decimal Number | 16 | < 0.1% |
| Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4570771 | |
| e | 2938293 | |
| i | 2865053 | |
| o | 2432895 | |
| r | 2316840 | |
| t | 2192044 | |
| n | 2160053 | |
| p | 1690137 | 6.0% |
| d | 1537742 | 5.4% |
| l | 1127926 | 4.0% |
| Other values (16) | 4398876 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1474018 | |
| I | 598175 | |
| C | 245213 | 7.1% |
| H | 231711 | 6.7% |
| L | 182525 | 5.3% |
| O | 125491 | 3.6% |
| P | 113908 | 3.3% |
| D | 95369 | 2.8% |
| S | 80616 | 2.3% |
| Z | 57602 | 1.7% |
| Other values (15) | 262702 | 7.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 0 | 2 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2867419 | |
| ? | 39 | < 0.1% |
| / | 30 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2867785 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31697960 | |
| Common | 5735293 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4570771 | |
| e | 2938293 | 9.3% |
| i | 2865053 | 9.0% |
| o | 2432895 | 7.7% |
| r | 2316840 | 7.3% |
| t | 2192044 | 6.9% |
| n | 2160053 | 6.8% |
| p | 1690137 | 5.3% |
| d | 1537742 | 4.9% |
| A | 1474018 | 4.7% |
| Other values (41) | 7520114 |
Common
| Value | Count | Frequency (%) |
| 2867785 | ||
| , | 2867419 | |
| ? | 39 | < 0.1% |
| / | 30 | < 0.1% |
| _ | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 9 | 3 | < 0.1% |
| 0 | 2 | < 0.1% |
| 1 | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37433253 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4570771 | |
| e | 2938293 | 7.8% |
| 2867785 | 7.7% | |
| , | 2867419 | 7.7% |
| i | 2865053 | 7.7% |
| o | 2432895 | 6.5% |
| r | 2316840 | 6.2% |
| t | 2192044 | 5.9% |
| n | 2160053 | 5.8% |
| p | 1690137 | 4.5% |
| Other values (53) | 10531963 |
kingdom
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 8.046071608 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 599978 | |
| incertae | 4644 | 0.8% |
| sedis | 4644 | 0.8% |
| 9417 | 1 | < 0.1% |
| 4209 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1209244 | |
| a | 1204600 | |
| n | 604622 | |
| A | 599978 | |
| m | 599978 | |
| l | 599978 | |
| e | 13932 | 0.3% |
| s | 9288 | 0.2% |
| 4644 | 0.1% | |
| d | 4644 | 0.1% |
| Other values (9) | 13940 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4260218 | |
| Uppercase Letter | 599978 | 12.3% |
| Space Separator | 4644 | 0.1% |
| Decimal Number | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1209244 | |
| a | 1204600 | |
| n | 604622 | |
| m | 599978 | |
| l | 599978 | |
| e | 13932 | 0.3% |
| s | 9288 | 0.2% |
| d | 4644 | 0.1% |
| t | 4644 | 0.1% |
| r | 4644 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 4 | 2 | |
| 1 | 1 | |
| 7 | 1 | |
| 2 | 1 | |
| 0 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 599978 |
Space Separator
| Value | Count | Frequency (%) |
| 4644 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4860196 | |
| Common | 4652 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1209244 | |
| a | 1204600 | |
| n | 604622 | |
| A | 599978 | |
| m | 599978 | |
| l | 599978 | |
| e | 13932 | 0.3% |
| s | 9288 | 0.2% |
| d | 4644 | 0.1% |
| t | 4644 | 0.1% |
| Other values (2) | 9288 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 4644 | ||
| 9 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 1 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4864848 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1209244 | |
| a | 1204600 | |
| n | 604622 | |
| A | 599978 | |
| m | 599978 | |
| l | 599978 | |
| e | 13932 | 0.3% |
| s | 9288 | 0.2% |
| 4644 | 0.1% | |
| d | 4644 | 0.1% |
| Other values (9) | 13940 | 0.3% |
phylum
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5245 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.999918249 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arthropoda |
|---|---|
| 2nd row | Arthropoda |
| 3rd row | Arthropoda |
| 4th row | Arthropoda |
| 5th row | Arthropoda |
| Value | Count | Frequency (%) |
| arthropoda | 599346 | |
| cnidaria | 18 | < 0.1% |
| onychophora | 6 | < 0.1% |
| mollusca | 5 | < 0.1% |
| chordata | 2 | < 0.1% |
| 1936987 | 1 | < 0.1% |
| nemertea | 1 | < 0.1% |
| 1424684 | 1 | < 0.1% |
| echinodermata | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1198720 | |
| o | 1198712 | |
| a | 599400 | |
| d | 599367 | |
| h | 599361 | |
| p | 599352 | |
| t | 599350 | |
| A | 599346 | |
| i | 37 | < 0.1% |
| n | 25 | < 0.1% |
| Other values (20) | 91 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5394368 | |
| Uppercase Letter | 599379 | 10.0% |
| Decimal Number | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1198720 | |
| o | 1198712 | |
| a | 599400 | |
| d | 599367 | |
| h | 599361 | |
| p | 599352 | |
| t | 599350 | |
| i | 37 | < 0.1% |
| n | 25 | < 0.1% |
| c | 12 | < 0.1% |
| Other values (6) | 32 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 8 | 2 | |
| 6 | 2 | |
| 9 | 2 | |
| 1 | 2 | |
| 7 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 599346 | |
| C | 20 | < 0.1% |
| O | 6 | < 0.1% |
| M | 5 | < 0.1% |
| N | 1 | < 0.1% |
| E | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5993747 | |
| Common | 14 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 1198720 | |
| o | 1198712 | |
| a | 599400 | |
| d | 599367 | |
| h | 599361 | |
| p | 599352 | |
| t | 599350 | |
| A | 599346 | |
| i | 37 | < 0.1% |
| n | 25 | < 0.1% |
| Other values (12) | 77 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 8 | 2 | |
| 6 | 2 | |
| 9 | 2 | |
| 1 | 2 | |
| 7 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5993761 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 1198720 | |
| o | 1198712 | |
| a | 599400 | |
| d | 599367 | |
| h | 599361 | |
| p | 599352 | |
| t | 599350 | |
| A | 599346 | |
| i | 37 | < 0.1% |
| n | 25 | < 0.1% |
| Other values (20) | 91 | < 0.1% |
class
Text
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5283 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 7.038410393 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Insecta |
|---|---|
| 2nd row | Insecta |
| 3rd row | Insecta |
| 4th row | Insecta |
| 5th row | Insecta |
| Value | Count | Frequency (%) |
| insecta | 588111 | |
| arachnida | 7917 | 1.3% |
| diplopoda | 1599 | 0.3% |
| collembola | 820 | 0.1% |
| chilopoda | 736 | 0.1% |
| diplura | 77 | < 0.1% |
| protura | 62 | < 0.1% |
| symphyla | 8 | < 0.1% |
| malacostraca | 5 | < 0.1% |
| pauropoda | 4 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 607282 | |
| c | 596040 | |
| n | 596030 | |
| e | 588932 | |
| t | 588180 | |
| s | 588119 | |
| I | 588111 | |
| i | 10333 | 0.2% |
| d | 10259 | 0.2% |
| h | 8663 | 0.2% |
| Other values (16) | 36473 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3619079 | |
| Uppercase Letter | 599343 | 14.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 607282 | |
| c | 596040 | |
| n | 596030 | |
| e | 588932 | |
| t | 588180 | |
| s | 588119 | |
| i | 10333 | 0.3% |
| d | 10259 | 0.3% |
| h | 8663 | 0.2% |
| r | 8130 | 0.2% |
| Other values (7) | 17111 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 588111 | |
| A | 7917 | 1.3% |
| D | 1676 | 0.3% |
| C | 1556 | 0.3% |
| P | 66 | < 0.1% |
| S | 8 | < 0.1% |
| M | 5 | < 0.1% |
| G | 2 | < 0.1% |
| E | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4218422 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 607282 | |
| c | 596040 | |
| n | 596030 | |
| e | 588932 | |
| t | 588180 | |
| s | 588119 | |
| I | 588111 | |
| i | 10333 | 0.2% |
| d | 10259 | 0.2% |
| h | 8663 | 0.2% |
| Other values (16) | 36473 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4218422 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 607282 | |
| c | 596040 | |
| n | 596030 | |
| e | 588932 | |
| t | 588180 | |
| s | 588119 | |
| I | 588111 | |
| i | 10333 | 0.2% |
| d | 10259 | 0.2% |
| h | 8663 | 0.2% |
| Other values (16) | 36473 | 0.9% |
order
Text
| Distinct | 74 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5577 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 9.451483935 |
| Min length | 6 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Hymenoptera |
|---|---|
| 2nd row | Lepidoptera |
| 3rd row | Lepidoptera |
| 4th row | Odonata |
| 5th row | Coleoptera |
| Value | Count | Frequency (%) |
| hymenoptera | 146330 | |
| odonata | 117284 | |
| lepidoptera | 99491 | |
| diptera | 73566 | |
| coleoptera | 71961 | |
| hemiptera | 37757 | 6.3% |
| siphonaptera | 10087 | 1.7% |
| trichoptera | 9104 | 1.5% |
| thysanoptera | 4628 | 0.8% |
| araneae | 4624 | 0.8% |
| Other values (64) | 24217 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 849733 | |
| a | 742019 | |
| t | 591304 | |
| p | 577797 | |
| o | 563778 | |
| r | 489265 | |
| n | 284180 | 5.0% |
| i | 238281 | 4.2% |
| d | 228781 | 4.0% |
| m | 192363 | 3.4% |
| Other values (37) | 904401 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5062841 | |
| Uppercase Letter | 599047 | 10.6% |
| Decimal Number | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 849733 | |
| a | 742019 | |
| t | 591304 | |
| p | 577797 | |
| o | 563778 | |
| r | 489265 | |
| n | 284180 | 5.6% |
| i | 238281 | 4.7% |
| d | 228781 | 4.5% |
| m | 192363 | 3.8% |
| Other values (12) | 305340 | 6.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 184087 | |
| O | 118498 | |
| L | 99813 | |
| D | 73747 | |
| C | 72099 | 12.0% |
| T | 15063 | 2.5% |
| S | 11901 | 2.0% |
| P | 7210 | 1.2% |
| M | 4867 | 0.8% |
| A | 4742 | 0.8% |
| Other values (8) | 7020 | 1.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5661888 | |
| Common | 14 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 849733 | |
| a | 742019 | |
| t | 591304 | |
| p | 577797 | |
| o | 563778 | |
| r | 489265 | |
| n | 284180 | 5.0% |
| i | 238281 | 4.2% |
| d | 228781 | 4.0% |
| m | 192363 | 3.4% |
| Other values (30) | 904387 |
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 7 | 2 | 14.3% |
| 0 | 2 | 14.3% |
| 4 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5661902 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 849733 | |
| a | 742019 | |
| t | 591304 | |
| p | 577797 | |
| o | 563778 | |
| r | 489265 | |
| n | 284180 | 5.0% |
| i | 238281 | 4.2% |
| d | 228781 | 4.0% |
| m | 192363 | 3.4% |
| Other values (37) | 904401 |
superfamily
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19.5 |
| Mean length | 19.5 |
| Min length | 17 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Troides amphrysus |
|---|---|
| 2nd row | Gynacantha membranalis |
| Value | Count | Frequency (%) |
| troides | 1 | |
| amphrysus | 1 | |
| gynacantha | 1 | |
| membranalis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | 10.3% |
| n | 3 | 7.7% |
| m | 3 | 7.7% |
| r | 3 | 7.7% |
| i | 2 | 5.1% |
| e | 2 | 5.1% |
| 2 | 5.1% | |
| h | 2 | 5.1% |
| y | 2 | 5.1% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35 | |
| Space Separator | 2 | 5.1% |
| Uppercase Letter | 2 | 5.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | |
| n | 3 | |
| m | 3 | |
| r | 3 | |
| i | 2 | 5.7% |
| e | 2 | 5.7% |
| h | 2 | 5.7% |
| y | 2 | 5.7% |
| b | 1 | 2.9% |
| Other values (7) | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| G | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37 | |
| Common | 2 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | |
| n | 3 | 8.1% |
| m | 3 | 8.1% |
| r | 3 | 8.1% |
| i | 2 | 5.4% |
| e | 2 | 5.4% |
| h | 2 | 5.4% |
| y | 2 | 5.4% |
| T | 1 | 2.7% |
| Other values (9) | 9 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | 10.3% |
| n | 3 | 7.7% |
| m | 3 | 7.7% |
| r | 3 | 7.7% |
| i | 2 | 5.1% |
| e | 2 | 5.1% |
| 2 | 5.1% | |
| h | 2 | 5.1% |
| y | 2 | 5.1% |
| Other values (10) | 10 |
family
Text
Missing 
| Distinct | 1494 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 11642 |
| Missing (%) | 1.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 21 |
| Mean length | 10.49803873 |
| Min length | 6 |
Unique
| Unique | 196 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Formicidae |
|---|---|
| 2nd row | Gelechiidae |
| 3rd row | Sesiidae |
| 4th row | Coenagrionidae |
| 5th row | Carabidae |
| Value | Count | Frequency (%) |
| apidae | 82646 | 13.9% |
| libellulidae | 42503 | 7.2% |
| coenagrionidae | 36255 | 6.1% |
| chrysomelidae | 17448 | 2.9% |
| crambidae | 13614 | 2.3% |
| asilidae | 13374 | 2.3% |
| geometridae | 12793 | 2.2% |
| psychodidae | 11788 | 2.0% |
| curculionidae | 11689 | 2.0% |
| formicidae | 9878 | 1.7% |
| Other values (1490) | 341002 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 902858 | |
| e | 876852 | |
| a | 817696 | |
| d | 657115 | |
| o | 322939 | 5.2% |
| l | 317031 | 5.1% |
| r | 285013 | 4.6% |
| p | 208767 | 3.4% |
| n | 202426 | 3.3% |
| h | 150237 | 2.4% |
| Other values (50) | 1484235 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5632165 | |
| Uppercase Letter | 592986 | 9.5% |
| Decimal Number | 8 | < 0.1% |
| Space Separator | 6 | < 0.1% |
| Other Punctuation | 2 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 902858 | |
| e | 876852 | |
| a | 817696 | |
| d | 657115 | |
| o | 322939 | 5.7% |
| l | 317031 | 5.6% |
| r | 285013 | 5.1% |
| p | 208767 | 3.7% |
| n | 202426 | 3.6% |
| h | 150237 | 2.7% |
| Other values (16) | 891231 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 138628 | |
| A | 122946 | |
| L | 65061 | |
| P | 58656 | |
| T | 31926 | 5.4% |
| S | 31919 | 5.4% |
| G | 26736 | 4.5% |
| E | 18009 | 3.0% |
| M | 16962 | 2.9% |
| N | 16555 | 2.8% |
| Other values (16) | 65588 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 9 | 2 | |
| 7 | 2 | |
| 8 | 1 | 12.5% |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6225151 | |
| Common | 18 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 902858 | |
| e | 876852 | |
| a | 817696 | |
| d | 657115 | |
| o | 322939 | 5.2% |
| l | 317031 | 5.1% |
| r | 285013 | 4.6% |
| p | 208767 | 3.4% |
| n | 202426 | 3.3% |
| h | 150237 | 2.4% |
| Other values (42) | 1484217 |
Common
| Value | Count | Frequency (%) |
| 6 | ||
| 1 | 3 | |
| , | 2 | 11.1% |
| 9 | 2 | 11.1% |
| 7 | 2 | 11.1% |
| 8 | 1 | 5.6% |
| ( | 1 | 5.6% |
| ) | 1 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6225169 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 902858 | |
| e | 876852 | |
| a | 817696 | |
| d | 657115 | |
| o | 322939 | 5.2% |
| l | 317031 | 5.1% |
| r | 285013 | 4.6% |
| p | 208767 | 3.4% |
| n | 202426 | 3.3% |
| h | 150237 | 2.4% |
| Other values (50) | 1484235 |
subfamily
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19.5 |
| Mean length | 19.5 |
| Min length | 17 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Troides amphrysus |
|---|---|
| 2nd row | Gynacantha membranalis |
| Value | Count | Frequency (%) |
| troides | 1 | |
| amphrysus | 1 | |
| gynacantha | 1 | |
| membranalis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | 10.3% |
| n | 3 | 7.7% |
| m | 3 | 7.7% |
| r | 3 | 7.7% |
| i | 2 | 5.1% |
| e | 2 | 5.1% |
| 2 | 5.1% | |
| h | 2 | 5.1% |
| y | 2 | 5.1% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35 | |
| Space Separator | 2 | 5.1% |
| Uppercase Letter | 2 | 5.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | |
| n | 3 | |
| m | 3 | |
| r | 3 | |
| i | 2 | 5.7% |
| e | 2 | 5.7% |
| h | 2 | 5.7% |
| y | 2 | 5.7% |
| b | 1 | 2.9% |
| Other values (7) | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| G | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37 | |
| Common | 2 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | |
| n | 3 | 8.1% |
| m | 3 | 8.1% |
| r | 3 | 8.1% |
| i | 2 | 5.4% |
| e | 2 | 5.4% |
| h | 2 | 5.4% |
| y | 2 | 5.4% |
| T | 1 | 2.7% |
| Other values (9) | 9 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| s | 4 | 10.3% |
| n | 3 | 7.7% |
| m | 3 | 7.7% |
| r | 3 | 7.7% |
| i | 2 | 5.1% |
| e | 2 | 5.1% |
| 2 | 5.1% | |
| h | 2 | 5.1% |
| y | 2 | 5.1% |
| Other values (10) | 10 |
subtribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| Value | Count | Frequency (%) |
| eml | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2 | |
| M | 2 | |
| L | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2 | |
| M | 2 | |
| L | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2 | |
| M | 2 | |
| L | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2 | |
| M | 2 | |
| L | 2 |
genus
Text
Missing 
| Distinct | 35722 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 19883 |
| Missing (%) | 3.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 8.97094279 |
| Min length | 3 |
Unique
| Unique | 11794 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Camponotus |
|---|---|
| 2nd row | Athrips |
| 3rd row | Paranthrene |
| 4th row | Acanthagrion |
| 5th row | Calathus |
| Value | Count | Frequency (%) |
| bombus | 62386 | 10.7% |
| xylocopa | 11739 | 2.0% |
| argia | 8660 | 1.5% |
| enallagma | 7903 | 1.4% |
| crambus | 7885 | 1.3% |
| ischnura | 7465 | 1.3% |
| sympetrum | 6026 | 1.0% |
| apis | 4967 | 0.8% |
| erythrodiplax | 4175 | 0.7% |
| lestes | 4149 | 0.7% |
| Other values (35712) | 459388 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 535407 | 10.2% |
| o | 472261 | 9.0% |
| s | 396292 | 7.6% |
| i | 368556 | 7.0% |
| e | 354889 | 6.8% |
| r | 324058 | 6.2% |
| l | 257744 | 4.9% |
| u | 248449 | 4.7% |
| t | 231309 | 4.4% |
| m | 228883 | 4.4% |
| Other values (54) | 1827848 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4660907 | |
| Uppercase Letter | 584745 | 11.1% |
| Decimal Number | 34 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 535407 | |
| o | 472261 | 10.1% |
| s | 396292 | 8.5% |
| i | 368556 | 7.9% |
| e | 354889 | 7.6% |
| r | 324058 | 7.0% |
| l | 257744 | 5.5% |
| u | 248449 | 5.3% |
| t | 231309 | 5.0% |
| m | 228883 | 4.9% |
| Other values (16) | 1243059 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 76807 | |
| P | 69931 | |
| A | 64651 | |
| C | 63912 | |
| E | 41054 | 7.0% |
| S | 37255 | 6.4% |
| L | 29087 | 5.0% |
| H | 28222 | 4.8% |
| M | 27165 | 4.6% |
| T | 26554 | 4.5% |
| Other values (16) | 120107 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 1 | 5 | |
| 0 | 4 | |
| 4 | 4 | |
| 3 | 3 | 8.8% |
| 5 | 3 | 8.8% |
| 9 | 2 | 5.9% |
| 8 | 2 | 5.9% |
| 6 | 2 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 | |
| . | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5245652 | |
| Common | 44 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 535407 | 10.2% |
| o | 472261 | 9.0% |
| s | 396292 | 7.6% |
| i | 368556 | 7.0% |
| e | 354889 | 6.8% |
| r | 324058 | 6.2% |
| l | 257744 | 4.9% |
| u | 248449 | 4.7% |
| t | 231309 | 4.4% |
| m | 228883 | 4.4% |
| Other values (42) | 1827804 |
Common
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 1 | 5 | |
| 0 | 4 | |
| 4 | 4 | |
| - | 4 | |
| : | 4 | |
| 3 | 3 | 6.8% |
| 5 | 3 | 6.8% |
| 9 | 2 | 4.5% |
| 8 | 2 | 4.5% |
| Other values (2) | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5245696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 535407 | 10.2% |
| o | 472261 | 9.0% |
| s | 396292 | 7.6% |
| i | 368556 | 7.0% |
| e | 354889 | 6.8% |
| r | 324058 | 6.2% |
| l | 257744 | 4.9% |
| u | 248449 | 4.7% |
| t | 231309 | 4.4% |
| m | 228883 | 4.4% |
| Other values (54) | 1827848 |
genericName
Text
Missing 
| Distinct | 38103 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 19882 |
| Missing (%) | 3.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 8.918990191 |
| Min length | 1 |
Unique
| Unique | 13468 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | Camponotus |
|---|---|
| 2nd row | Athrips |
| 3rd row | Paranthrene |
| 4th row | Acanthagrion |
| 5th row | Calathus |
| Value | Count | Frequency (%) |
| bombus | 62365 | 10.7% |
| xylocopa | 11743 | 2.0% |
| argia | 8660 | 1.5% |
| enallagma | 7977 | 1.4% |
| crambus | 7970 | 1.4% |
| ischnura | 7456 | 1.3% |
| sympetrum | 6028 | 1.0% |
| apis | 4968 | 0.8% |
| lestes | 4235 | 0.7% |
| erythrodiplax | 4175 | 0.7% |
| Other values (38093) | 459167 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 528684 | 10.1% |
| o | 470494 | 9.0% |
| s | 396333 | 7.6% |
| i | 366131 | 7.0% |
| e | 352590 | 6.8% |
| r | 320159 | 6.1% |
| l | 255087 | 4.9% |
| u | 247647 | 4.7% |
| m | 230840 | 4.4% |
| t | 230398 | 4.4% |
| Other values (55) | 1816963 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4630536 | |
| Uppercase Letter | 584735 | 11.2% |
| Decimal Number | 34 | < 0.1% |
| Other Punctuation | 17 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 528684 | |
| o | 470494 | 10.2% |
| s | 396333 | 8.6% |
| i | 366131 | 7.9% |
| e | 352590 | 7.6% |
| r | 320159 | 6.9% |
| l | 255087 | 5.5% |
| u | 247647 | 5.3% |
| m | 230840 | 5.0% |
| t | 230398 | 5.0% |
| Other values (18) | 1232173 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 76873 | |
| P | 68862 | |
| A | 65541 | |
| C | 63934 | |
| E | 40426 | 6.9% |
| S | 36934 | 6.3% |
| L | 31138 | 5.3% |
| T | 27792 | 4.8% |
| H | 27719 | 4.7% |
| M | 26116 | 4.5% |
| Other values (16) | 119400 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 10 | |
| 1 | 8 | |
| 4 | 6 | |
| 0 | 4 | 11.8% |
| 8 | 2 | 5.9% |
| 3 | 2 | 5.9% |
| 6 | 2 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 11 | |
| : | 4 | 23.5% |
| . | 2 | 11.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5215271 | |
| Common | 55 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 528684 | 10.1% |
| o | 470494 | 9.0% |
| s | 396333 | 7.6% |
| i | 366131 | 7.0% |
| e | 352590 | 6.8% |
| r | 320159 | 6.1% |
| l | 255087 | 4.9% |
| u | 247647 | 4.7% |
| m | 230840 | 4.4% |
| t | 230398 | 4.4% |
| Other values (44) | 1816908 |
Common
| Value | Count | Frequency (%) |
| ? | 11 | |
| 2 | 10 | |
| 1 | 8 | |
| 4 | 6 | |
| 0 | 4 | 7.3% |
| - | 4 | 7.3% |
| : | 4 | 7.3% |
| 8 | 2 | 3.6% |
| 3 | 2 | 3.6% |
| . | 2 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5215319 | |
| None | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 528684 | 10.1% |
| o | 470494 | 9.0% |
| s | 396333 | 7.6% |
| i | 366131 | 7.0% |
| e | 352590 | 6.8% |
| r | 320159 | 6.1% |
| l | 255087 | 4.9% |
| u | 247647 | 4.7% |
| m | 230840 | 4.4% |
| t | 230398 | 4.4% |
| Other values (53) | 1816956 |
None
| Value | Count | Frequency (%) |
| ö | 6 | |
| ü | 1 | 14.3% |
subgenus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | true |
| Value | Count | Frequency (%) |
| true | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| e | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| e | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| e | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| e | 2 |
specificEpithet
Text
Missing 
| Distinct | 74464 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 109508 |
| Missing (%) | 18.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 8.680070205 |
| Min length | 2 |
Unique
| Unique | 40224 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | rufoglaucus |
|---|---|
| 2nd row | mesoleuca |
| 3rd row | asilipennis |
| 4th row | trilobatum |
| 5th row | nanulus |
| Value | Count | Frequency (%) |
| sylvicola | 6282 | 1.3% |
| bifarius | 4077 | 0.8% |
| kirbyellus | 3621 | 0.7% |
| flavifrons | 3474 | 0.7% |
| impatiens | 3132 | 0.6% |
| nevadensis | 2510 | 0.5% |
| cerana | 2431 | 0.5% |
| affinis | 2243 | 0.5% |
| mixtus | 2136 | 0.4% |
| bimaculatus | 2025 | 0.4% |
| Other values (74454) | 463187 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 563643 | |
| i | 497353 | |
| s | 385336 | 9.0% |
| e | 331379 | 7.7% |
| l | 295576 | 6.9% |
| n | 285391 | 6.6% |
| r | 268158 | 6.2% |
| u | 259788 | 6.0% |
| t | 231478 | 5.4% |
| c | 208825 | 4.9% |
| Other values (22) | 970732 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4297456 | |
| Dash Punctuation | 198 | < 0.1% |
| Decimal Number | 4 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 563643 | |
| i | 497353 | |
| s | 385336 | 9.0% |
| e | 331379 | 7.7% |
| l | 295576 | 6.9% |
| n | 285391 | 6.6% |
| r | 268158 | 6.2% |
| u | 259788 | 6.0% |
| t | 231478 | 5.4% |
| c | 208825 | 4.9% |
| Other values (18) | 970529 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 198 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4297456 | |
| Common | 203 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 563643 | |
| i | 497353 | |
| s | 385336 | 9.0% |
| e | 331379 | 7.7% |
| l | 295576 | 6.9% |
| n | 285391 | 6.6% |
| r | 268158 | 6.2% |
| u | 259788 | 6.0% |
| t | 231478 | 5.4% |
| c | 208825 | 4.9% |
| Other values (18) | 970529 |
Common
| Value | Count | Frequency (%) |
| - | 198 | |
| 1 | 2 | 1.0% |
| 3 | 2 | 1.0% |
| ' | 1 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4297653 | |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 563643 | |
| i | 497353 | |
| s | 385336 | 9.0% |
| e | 331379 | 7.7% |
| l | 295576 | 6.9% |
| n | 285391 | 6.6% |
| r | 268158 | 6.2% |
| u | 259788 | 6.0% |
| t | 231478 | 5.4% |
| c | 208825 | 4.9% |
| Other values (20) | 970726 |
None
| Value | Count | Frequency (%) |
| ü | 4 | |
| ö | 2 |
Missing 
| Distinct | 4964 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 586367 |
| Missing (%) | 97.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 17 |
| Mean length | 8.306752834 |
| Min length | 3 |
Unique
| Unique | 3559 ? |
|---|---|
| Unique (%) | 19.5% |
Sample
| 1st row | rufigenis |
|---|---|
| 2nd row | marianae |
| 3rd row | neglectum |
| 4th row | lavatus |
| 5th row | floridensis |
| Value | Count | Frequency (%) |
| violacea | 979 | 5.4% |
| vagans | 869 | 4.8% |
| portia | 724 | 4.0% |
| auricomus | 587 | 3.2% |
| virginica | 587 | 3.2% |
| dorsata | 437 | 2.4% |
| arizonensis | 431 | 2.4% |
| bantorum | 320 | 1.8% |
| binghami | 303 | 1.7% |
| californica | 291 | 1.6% |
| Other values (4954) | 12731 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 22661 | |
| i | 18593 | |
| s | 12613 | 8.3% |
| n | 10870 | 7.2% |
| r | 10424 | 6.9% |
| e | 9656 | 6.4% |
| o | 9202 | 6.1% |
| c | 7875 | 5.2% |
| u | 7729 | 5.1% |
| l | 7489 | 4.9% |
| Other values (17) | 34561 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 151672 | |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 22661 | |
| i | 18593 | |
| s | 12613 | 8.3% |
| n | 10870 | 7.2% |
| r | 10424 | 6.9% |
| e | 9656 | 6.4% |
| o | 9202 | 6.1% |
| c | 7875 | 5.2% |
| u | 7729 | 5.1% |
| l | 7489 | 4.9% |
| Other values (16) | 34560 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 151672 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 22661 | |
| i | 18593 | |
| s | 12613 | 8.3% |
| n | 10870 | 7.2% |
| r | 10424 | 6.9% |
| e | 9656 | 6.4% |
| o | 9202 | 6.1% |
| c | 7875 | 5.2% |
| u | 7729 | 5.1% |
| l | 7489 | 4.9% |
| Other values (16) | 34560 |
Common
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 151673 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 22661 | |
| i | 18593 | |
| s | 12613 | 8.3% |
| n | 10870 | 7.2% |
| r | 10424 | 6.9% |
| e | 9656 | 6.4% |
| o | 9202 | 6.1% |
| c | 7875 | 5.2% |
| u | 7729 | 5.1% |
| l | 7489 | 4.9% |
| Other values (17) | 34561 |
cultivarEpithet
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8.5 |
| Mean length | 8.5 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | ASIA |
|---|---|
| 2nd row | LATIN_AMERICA |
| Value | Count | Frequency (%) |
| asia | 1 | |
| latin_america | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 5 | |
| I | 3 | |
| S | 1 | 5.9% |
| L | 1 | 5.9% |
| T | 1 | 5.9% |
| N | 1 | 5.9% |
| _ | 1 | 5.9% |
| M | 1 | 5.9% |
| E | 1 | 5.9% |
| R | 1 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16 | |
| Connector Punctuation | 1 | 5.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5 | |
| I | 3 | |
| S | 1 | 6.2% |
| L | 1 | 6.2% |
| T | 1 | 6.2% |
| N | 1 | 6.2% |
| M | 1 | 6.2% |
| E | 1 | 6.2% |
| R | 1 | 6.2% |
| C | 1 | 6.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 | |
| Common | 1 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 5 | |
| I | 3 | |
| S | 1 | 6.2% |
| L | 1 | 6.2% |
| T | 1 | 6.2% |
| N | 1 | 6.2% |
| M | 1 | 6.2% |
| E | 1 | 6.2% |
| R | 1 | 6.2% |
| C | 1 | 6.2% |
Common
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 5 | |
| I | 3 | |
| S | 1 | 5.9% |
| L | 1 | 5.9% |
| T | 1 | 5.9% |
| N | 1 | 5.9% |
| _ | 1 | 5.9% |
| M | 1 | 5.9% |
| E | 1 | 5.9% |
| R | 1 | 5.9% |
taxonRank
Text
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.758805472 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | VARIETY |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 476863 | |
| genus | 89611 | 14.8% |
| subspecies | 17825 | 2.9% |
| family | 10445 | 1.7% |
| kingdom | 4662 | 0.8% |
| order | 4514 | 0.7% |
| variety | 391 | 0.1% |
| class | 253 | < 0.1% |
| form | 41 | < 0.1% |
| unranked | 11 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1097318 | |
| E | 1083905 | |
| I | 510188 | |
| C | 494943 | |
| P | 494694 | |
| U | 107453 | 2.6% |
| N | 94297 | 2.3% |
| G | 94273 | 2.3% |
| B | 17825 | 0.4% |
| M | 15156 | 0.4% |
| Other values (12) | 76484 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4086534 | |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1097318 | |
| E | 1083905 | |
| I | 510188 | |
| C | 494943 | |
| P | 494694 | |
| U | 107453 | 2.6% |
| N | 94297 | 2.3% |
| G | 94273 | 2.3% |
| B | 17825 | 0.4% |
| M | 15156 | 0.4% |
| Other values (11) | 76482 | 1.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4086534 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1097318 | |
| E | 1083905 | |
| I | 510188 | |
| C | 494943 | |
| P | 494694 | |
| U | 107453 | 2.6% |
| N | 94297 | 2.3% |
| G | 94273 | 2.3% |
| B | 17825 | 0.4% |
| M | 15156 | 0.4% |
| Other values (11) | 76482 | 1.9% |
Common
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4086536 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1097318 | |
| E | 1083905 | |
| I | 510188 | |
| C | 494943 | |
| P | 494694 | |
| U | 107453 | 2.6% |
| N | 94297 | 2.3% |
| G | 94273 | 2.3% |
| B | 17825 | 0.4% |
| M | 15156 | 0.4% |
| Other values (12) | 76484 | 1.9% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | PER |
|---|
| Value | Count | Frequency (%) |
| per | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
vernacularName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | TYPE |
|---|---|
| 2nd row | Peru |
| Value | Count | Frequency (%) |
| type | 1 | |
| peru | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 2 | |
| T | 1 | |
| Y | 1 | |
| E | 1 | |
| e | 1 | |
| r | 1 | |
| u | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5 | |
| Lowercase Letter | 3 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| T | 1 | |
| Y | 1 | |
| E | 1 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1 | |
| r | 1 | |
| u | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 2 | |
| T | 1 | |
| Y | 1 | |
| E | 1 | |
| e | 1 | |
| r | 1 | |
| u | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 2 | |
| T | 1 | |
| Y | 1 | |
| E | 1 | |
| e | 1 | |
| r | 1 | |
| u | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | PER.16_1 |
|---|
| Value | Count | Frequency (%) |
| per.16_1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| P | 1 | |
| E | 1 | |
| R | 1 | |
| . | 1 | |
| 6 | 1 | |
| _ | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 | |
| Uppercase Letter | 3 | |
| Other Punctuation | 1 | 12.5% |
| Connector Punctuation | 1 | 12.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 6 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 | |
| Latin | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 1 | |
| 6 | 1 | |
| _ | 1 |
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| P | 1 | |
| E | 1 | |
| R | 1 | |
| . | 1 | |
| 6 | 1 | |
| _ | 1 |
taxonomicStatus
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4647 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.880410814 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | SYNONYM |
| Value | Count | Frequency (%) |
| accepted | 518943 | |
| synonym | 71747 | 12.0% |
| doubtful | 9288 | 1.5% |
| lima | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1037886 | |
| C | 1037886 | |
| T | 528231 | |
| D | 528231 | |
| A | 518943 | |
| P | 518943 | |
| N | 143494 | 3.0% |
| Y | 143494 | 3.0% |
| O | 81035 | 1.7% |
| S | 71747 | 1.5% |
| Other values (8) | 118191 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4728078 | |
| Lowercase Letter | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1037886 | |
| C | 1037886 | |
| T | 528231 | |
| D | 528231 | |
| A | 518943 | |
| P | 518943 | |
| N | 143494 | 3.0% |
| Y | 143494 | 3.0% |
| O | 81035 | 1.7% |
| S | 71747 | 1.5% |
| Other values (5) | 118188 | 2.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1 | |
| m | 1 | |
| a | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4728081 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1037886 | |
| C | 1037886 | |
| T | 528231 | |
| D | 528231 | |
| A | 518943 | |
| P | 518943 | |
| N | 143494 | 3.0% |
| Y | 143494 | 3.0% |
| O | 81035 | 1.7% |
| S | 71747 | 1.5% |
| Other values (8) | 118191 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4728081 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1037886 | |
| C | 1037886 | |
| T | 528231 | |
| D | 528231 | |
| A | 518943 | |
| P | 518943 | |
| N | 143494 | 3.0% |
| Y | 143494 | 3.0% |
| O | 81035 | 1.7% |
| S | 71747 | 1.5% |
| Other values (8) | 118191 | 2.5% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | PER.16.6_1 |
|---|
| Value | Count | Frequency (%) |
| per.16.6_1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2 | |
| 1 | 2 | |
| 6 | 2 | |
| P | 1 | |
| E | 1 | |
| R | 1 | |
| _ | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Uppercase Letter | 3 | |
| Other Punctuation | 2 | |
| Connector Punctuation | 1 | 10.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 6 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 | |
| Latin | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 2 | |
| 1 | 2 | |
| 6 | 2 | |
| _ | 1 |
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2 | |
| 1 | 2 | |
| 6 | 2 | |
| P | 1 | |
| E | 1 | |
| R | 1 | |
| _ | 1 |
taxonRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Huarochiri |
|---|
| Value | Count | Frequency (%) |
| huarochiri | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2 | |
| i | 2 | |
| H | 1 | |
| u | 1 | |
| a | 1 | |
| o | 1 | |
| c | 1 | |
| h | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Uppercase Letter | 1 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2 | |
| i | 2 | |
| u | 1 | |
| a | 1 | |
| o | 1 | |
| c | 1 | |
| h | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2 | |
| i | 2 | |
| H | 1 | |
| u | 1 | |
| a | 1 | |
| o | 1 | |
| c | 1 | |
| h | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2 | |
| i | 2 | |
| H | 1 | |
| u | 1 | |
| a | 1 | |
| o | 1 | |
| c | 1 | |
| h | 1 |
datasetKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 35.99996196 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 604622 | |
| per.16.6.16_1 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2418488 | |
| a | 2418488 | |
| - | 2418488 | |
| 4 | 1813866 | |
| b | 1813866 | |
| 2 | 1813866 | |
| d | 1209244 | 5.6% |
| 9 | 1209244 | 5.6% |
| 5 | 1209244 | 5.6% |
| 8 | 1209244 | 5.6% |
| Other values (11) | 4232367 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10883202 | |
| Lowercase Letter | 8464708 | |
| Dash Punctuation | 2418488 | 11.1% |
| Other Punctuation | 3 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1813866 | |
| 2 | 1813866 | |
| 9 | 1209244 | |
| 5 | 1209244 | |
| 8 | 1209244 | |
| 3 | 1209244 | |
| 1 | 604625 | 5.6% |
| 6 | 604625 | 5.6% |
| 7 | 604622 | 5.6% |
| 0 | 604622 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2418488 | |
| a | 2418488 | |
| b | 1813866 | |
| d | 1209244 | |
| e | 604622 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| E | 1 | |
| R | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2418488 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13301694 | |
| Latin | 8464711 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2418488 | |
| 4 | 1813866 | |
| 2 | 1813866 | |
| 9 | 1209244 | |
| 5 | 1209244 | |
| 8 | 1209244 | |
| 3 | 1209244 | |
| 1 | 604625 | 4.5% |
| 6 | 604625 | 4.5% |
| 7 | 604622 | 4.5% |
| Other values (3) | 604626 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 2418488 | |
| a | 2418488 | |
| b | 1813866 | |
| d | 1209244 | |
| e | 604622 | 7.1% |
| P | 1 | < 0.1% |
| E | 1 | < 0.1% |
| R | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21766405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2418488 | |
| a | 2418488 | |
| - | 2418488 | |
| 4 | 1813866 | |
| b | 1813866 | |
| 2 | 1813866 | |
| d | 1209244 | 5.6% |
| 9 | 1209244 | 5.6% |
| 5 | 1209244 | 5.6% |
| 8 | 1209244 | 5.6% |
| Other values (11) | 4232367 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 2 |
| Mean length | 2.000014885 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 604622 | |
| san | 1 | < 0.1% |
| antonio | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 604623 | |
| U | 604622 | |
| n | 3 | < 0.1% |
| o | 2 | < 0.1% |
| a | 1 | < 0.1% |
| 1 | < 0.1% | |
| A | 1 | < 0.1% |
| t | 1 | < 0.1% |
| i | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1209246 | |
| Lowercase Letter | 8 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 3 | |
| o | 2 | |
| a | 1 | 12.5% |
| t | 1 | 12.5% |
| i | 1 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 604623 | |
| U | 604622 | |
| A | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1209254 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 604623 | |
| U | 604622 | |
| n | 3 | < 0.1% |
| o | 2 | < 0.1% |
| a | 1 | < 0.1% |
| A | 1 | < 0.1% |
| t | 1 | < 0.1% |
| i | 1 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1209255 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 604623 | |
| U | 604622 | |
| n | 3 | < 0.1% |
| o | 2 | < 0.1% |
| a | 1 | < 0.1% |
| 1 | < 0.1% | |
| A | 1 | < 0.1% |
| t | 1 | < 0.1% |
| i | 1 | < 0.1% |
lastInterpreted
Text
| Distinct | 186893 |
|---|---|
| Distinct (%) | 30.9% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.9957792 |
| Min length | 2 |
Unique
| Unique | 38990 ? |
|---|---|
| Unique (%) | 6.4% |
Sample
| 1st row | 2024-12-02T13:57:44.315Z |
|---|---|
| 2nd row | 2024-12-02T13:57:18.321Z |
| 3rd row | 2024-12-02T13:59:05.381Z |
| 4th row | 2024-12-02T13:57:22.450Z |
| 5th row | 2024-12-02T13:57:21.275Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:45.539z | 16 | < 0.1% |
| 2024-12-02t13:57:59.931z | 16 | < 0.1% |
| 2024-12-02t13:57:53.908z | 16 | < 0.1% |
| 2024-12-02t13:57:26.378z | 16 | < 0.1% |
| 2024-12-02t13:57:29.420z | 15 | < 0.1% |
| 2024-12-02t13:56:43.735z | 15 | < 0.1% |
| 2024-12-02t13:57:51.108z | 15 | < 0.1% |
| 2024-12-02t13:58:53.448z | 15 | < 0.1% |
| 2024-12-02t13:56:41.760z | 15 | < 0.1% |
| 2024-12-02t13:57:19.226z | 15 | < 0.1% |
| Other values (186883) | 604470 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| : | 1209244 | |
| - | 1209244 | |
| 4 | 972748 | 6.7% |
| 5 | 960823 | 6.6% |
| 3 | 957684 | 6.6% |
| T | 604622 | 4.2% |
| Z | 604622 | 4.2% |
| Other values (7) | 2171278 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10276693 | |
| Other Punctuation | 1813239 | 12.5% |
| Uppercase Letter | 1209248 | 8.3% |
| Dash Punctuation | 1209244 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| 4 | 972748 | 9.5% |
| 5 | 960823 | 9.3% |
| 3 | 957684 | 9.3% |
| 7 | 464169 | 4.5% |
| 9 | 387034 | 3.8% |
| 6 | 364187 | 3.5% |
| 8 | 351889 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 604622 | |
| Z | 604622 | |
| L | 2 | < 0.1% |
| C | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1209244 | |
| . | 603995 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1209244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13299176 | |
| Latin | 1209248 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| : | 1209244 | |
| - | 1209244 | |
| 4 | 972748 | 7.3% |
| 5 | 960823 | 7.2% |
| 3 | 957684 | 7.2% |
| . | 603995 | 4.5% |
| 7 | 464169 | 3.5% |
| Other values (3) | 1103110 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 604622 | |
| Z | 604622 | |
| L | 2 | < 0.1% |
| C | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14508424 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| : | 1209244 | |
| - | 1209244 | |
| 4 | 972748 | 6.7% |
| 5 | 960823 | 6.6% |
| 3 | 957684 | 6.6% |
| T | 604622 | 4.2% |
| Z | 604622 | 4.2% |
| Other values (7) | 2171278 |
elevation
Text
Missing 
| Distinct | 1990 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 557870 |
| Missing (%) | 92.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.390794764 |
| Min length | 3 |
Unique
| Unique | 527 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 2040.0 |
|---|---|
| 2nd row | 240.0 |
| 3rd row | 165.0 |
| 4th row | 400.0 |
| 5th row | 1300.0 |
| Value | Count | Frequency (%) |
| 2743.0 | 1163 | 2.5% |
| 3353.0 | 875 | 1.9% |
| 1524.0 | 704 | 1.5% |
| 1829.0 | 659 | 1.4% |
| 1100.0 | 556 | 1.2% |
| 914.0 | 524 | 1.1% |
| 427.0 | 524 | 1.1% |
| 250.0 | 506 | 1.1% |
| 200.0 | 496 | 1.1% |
| 1372.0 | 495 | 1.1% |
| Other values (1976) | 40254 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 73766 | |
| . | 46756 | |
| 1 | 25753 | 10.2% |
| 2 | 21657 | 8.6% |
| 5 | 16361 | 6.5% |
| 3 | 15333 | 6.1% |
| 4 | 13857 | 5.5% |
| 7 | 11572 | 4.6% |
| 6 | 9583 | 3.8% |
| 9 | 9338 | 3.7% |
| Other values (2) | 8076 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 205275 | |
| Other Punctuation | 46756 | 18.6% |
| Dash Punctuation | 21 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 73766 | |
| 1 | 25753 | 12.5% |
| 2 | 21657 | 10.6% |
| 5 | 16361 | 8.0% |
| 3 | 15333 | 7.5% |
| 4 | 13857 | 6.8% |
| 7 | 11572 | 5.6% |
| 6 | 9583 | 4.7% |
| 9 | 9338 | 4.5% |
| 8 | 8055 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46756 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 252052 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 73766 | |
| . | 46756 | |
| 1 | 25753 | 10.2% |
| 2 | 21657 | 8.6% |
| 5 | 16361 | 6.5% |
| 3 | 15333 | 6.1% |
| 4 | 13857 | 5.5% |
| 7 | 11572 | 4.6% |
| 6 | 9583 | 3.8% |
| 9 | 9338 | 3.7% |
| Other values (2) | 8076 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 252052 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 73766 | |
| . | 46756 | |
| 1 | 25753 | 10.2% |
| 2 | 21657 | 8.6% |
| 5 | 16361 | 6.5% |
| 3 | 15333 | 6.1% |
| 4 | 13857 | 5.5% |
| 7 | 11572 | 4.6% |
| 6 | 9583 | 3.8% |
| 9 | 9338 | 3.7% |
| Other values (2) | 8076 | 3.2% |
Missing 
| Distinct | 215 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 573282 |
| Missing (%) | 94.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.207886677 |
| Min length | 3 |
Unique
| Unique | 90 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 27237 | |
| 152.5 | 408 | 1.3% |
| 30.5 | 325 | 1.0% |
| 457.0 | 257 | 0.8% |
| 100.0 | 249 | 0.8% |
| 15.0 | 217 | 0.7% |
| 914.0 | 185 | 0.6% |
| 50.0 | 181 | 0.6% |
| 305.0 | 175 | 0.6% |
| 25.0 | 147 | 0.5% |
| Other values (205) | 1963 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 58739 | |
| . | 31342 | |
| 5 | 3647 | 3.6% |
| 1 | 1753 | 1.7% |
| 2 | 1231 | 1.2% |
| 3 | 993 | 1.0% |
| 7 | 914 | 0.9% |
| 4 | 802 | 0.8% |
| 6 | 537 | 0.5% |
| 9 | 371 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 69206 | |
| Other Punctuation | 31342 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 58739 | |
| 5 | 3647 | 5.3% |
| 1 | 1753 | 2.5% |
| 2 | 1231 | 1.8% |
| 3 | 993 | 1.4% |
| 7 | 914 | 1.3% |
| 4 | 802 | 1.2% |
| 6 | 537 | 0.8% |
| 9 | 371 | 0.5% |
| 8 | 219 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 31342 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 100548 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 58739 | |
| . | 31342 | |
| 5 | 3647 | 3.6% |
| 1 | 1753 | 1.7% |
| 2 | 1231 | 1.2% |
| 3 | 993 | 1.0% |
| 7 | 914 | 0.9% |
| 4 | 802 | 0.8% |
| 6 | 537 | 0.5% |
| 9 | 371 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 100548 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 58739 | |
| . | 31342 | |
| 5 | 3647 | 3.6% |
| 1 | 1753 | 1.7% |
| 2 | 1231 | 1.2% |
| 3 | 993 | 1.0% |
| 7 | 914 | 0.9% |
| 4 | 802 | 0.8% |
| 6 | 537 | 0.5% |
| 9 | 371 | 0.4% |
depth
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | 35.3% |
| Missing | 604592 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.147058824 |
| Min length | 5 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 17.6% |
Sample
| 1st row | 110.0 |
|---|---|
| 2nd row | 250.0 |
| 3rd row | 110.0 |
| 4th row | 370.0 |
| 5th row | 359.0 |
| Value | Count | Frequency (%) |
| 250.0 | 9 | |
| 110.0 | 6 | |
| 880.0 | 6 | |
| 370.0 | 3 | 8.8% |
| 1707.0 | 2 | 5.9% |
| 775.0 | 2 | 5.9% |
| 359.0 | 1 | 2.9% |
| 1400.0 | 1 | 2.9% |
| 1743.0 | 1 | 2.9% |
| 500.0 | 1 | 2.9% |
| Other values (2) | 2 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 68 | |
| . | 34 | |
| 1 | 16 | 9.1% |
| 5 | 13 | 7.4% |
| 7 | 13 | 7.4% |
| 8 | 12 | 6.9% |
| 2 | 9 | 5.1% |
| 3 | 6 | 3.4% |
| 4 | 2 | 1.1% |
| 9 | 1 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 141 | |
| Other Punctuation | 34 | 19.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 68 | |
| 1 | 16 | 11.3% |
| 5 | 13 | 9.2% |
| 7 | 13 | 9.2% |
| 8 | 12 | 8.5% |
| 2 | 9 | 6.4% |
| 3 | 6 | 4.3% |
| 4 | 2 | 1.4% |
| 9 | 1 | 0.7% |
| 6 | 1 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 34 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 175 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 68 | |
| . | 34 | |
| 1 | 16 | 9.1% |
| 5 | 13 | 7.4% |
| 7 | 13 | 7.4% |
| 8 | 12 | 6.9% |
| 2 | 9 | 5.1% |
| 3 | 6 | 3.4% |
| 4 | 2 | 1.1% |
| 9 | 1 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 175 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 68 | |
| . | 34 | |
| 1 | 16 | 9.1% |
| 5 | 13 | 7.4% |
| 7 | 13 | 7.4% |
| 8 | 12 | 6.9% |
| 2 | 9 | 5.1% |
| 3 | 6 | 3.4% |
| 4 | 2 | 1.1% |
| 9 | 1 | 0.6% |
depthAccuracy
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 604615 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.090909091 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 110.0 |
|---|---|
| 2nd row | 110.0 |
| 3rd row | 0.0 |
| 4th row | 110.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 110.0 | 6 | |
| 0.0 | 5 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 1 | 12 | |
| . | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34 | |
| Other Punctuation | 11 | 24.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 1 | 12 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 45 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 1 | 12 | |
| . | 11 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 1 | 12 | |
| . | 11 |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 259 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 601631 |
| Missing (%) | 99.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 14.27512521 |
| Min length | 3 |
Unique
| Unique | 107 ? |
|---|---|
| Unique (%) | 3.6% |
Sample
| 1st row | 4105.643932903784 |
|---|---|
| 2nd row | 4067.9280715056975 |
| 3rd row | 3039.0244431707993 |
| 4th row | 0.0 |
| 5th row | 2839.7634303533896 |
| Value | Count | Frequency (%) |
| 0.0 | 634 | |
| 4105.643932903784 | 593 | |
| 949.7490617483568 | 164 | 5.5% |
| 513.8699121355281 | 112 | 3.7% |
| 4282.192003849806 | 80 | 2.7% |
| 347.46362945305606 | 75 | 2.5% |
| 1404.2075323592617 | 56 | 1.9% |
| 512.1584099513866 | 45 | 1.5% |
| 247.47802974000376 | 40 | 1.3% |
| 3590.2355648532216 | 39 | 1.3% |
| Other values (249) | 1157 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5141 | |
| 4 | 5070 | |
| 3 | 4873 | |
| 9 | 4010 | |
| 1 | 3629 | |
| 2 | 3557 | |
| 6 | 3536 | |
| 5 | 3528 | |
| 8 | 3361 | |
| 7 | 3054 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 39759 | |
| Other Punctuation | 2995 | 7.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5141 | |
| 4 | 5070 | |
| 3 | 4873 | |
| 9 | 4010 | |
| 1 | 3629 | |
| 2 | 3557 | |
| 6 | 3536 | |
| 5 | 3528 | |
| 8 | 3361 | |
| 7 | 3054 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2995 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42754 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5141 | |
| 4 | 5070 | |
| 3 | 4873 | |
| 9 | 4010 | |
| 1 | 3629 | |
| 2 | 3557 | |
| 6 | 3536 | |
| 5 | 3528 | |
| 8 | 3361 | |
| 7 | 3054 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5141 | |
| 4 | 5070 | |
| 3 | 4873 | |
| 9 | 4010 | |
| 1 | 3629 | |
| 2 | 3557 | |
| 6 | 3536 | |
| 5 | 3528 | |
| 8 | 3361 | |
| 7 | 3054 |
issue
Text
| Distinct | 143 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2735 |
| Missing (%) | 0.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 200 |
|---|---|
| Median length | 198 |
| Mean length | 91.49480222 |
| Min length | 15 |
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates | 248827 | |
| occurrence_status_inferred_from_individual_count | 146633 | |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country | 70820 | 11.8% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;taxon_match_higherrank | 32084 | 5.3% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 31693 | 5.3% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid;continent_derived_from_coordinates | 21011 | 3.5% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;taxon_match_higherrank | 11828 | 2.0% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy | 6987 | 1.2% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;taxon_match_fuzzy | 6167 | 1.0% |
| occurrence_status_inferred_from_individual_count;country_invalid | 5364 | 0.9% |
| Other values (133) | 20477 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 5457778 | |
| E | 5050863 | 9.2% |
| R | 4409788 | 8.0% |
| N | 4266550 | 7.7% |
| I | 4035031 | 7.3% |
| D | 3989481 | 7.2% |
| T | 3935075 | 7.1% |
| O | 3811449 | 6.9% |
| C | 3682307 | 6.7% |
| U | 3181760 | 5.8% |
| Other values (18) | 13249816 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 48110489 | |
| Connector Punctuation | 5457778 | 9.9% |
| Other Punctuation | 865049 | 1.6% |
| Decimal Number | 636582 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 5050863 | |
| R | 4409788 | |
| N | 4266550 | |
| I | 4035031 | |
| D | 3989481 | |
| T | 3935075 | |
| O | 3811449 | |
| C | 3682307 | 7.7% |
| U | 3181760 | 6.6% |
| A | 2514457 | 5.2% |
| Other values (14) | 9233728 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 318291 | |
| 4 | 318291 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5457778 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 865049 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48110489 | |
| Common | 6959409 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 5050863 | |
| R | 4409788 | |
| N | 4266550 | |
| I | 4035031 | |
| D | 3989481 | |
| T | 3935075 | |
| O | 3811449 | |
| C | 3682307 | 7.7% |
| U | 3181760 | 6.6% |
| A | 2514457 | 5.2% |
| Other values (14) | 9233728 |
Common
| Value | Count | Frequency (%) |
| _ | 5457778 | |
| ; | 865049 | 12.4% |
| 8 | 318291 | 4.6% |
| 4 | 318291 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55069898 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 5457778 | |
| E | 5050863 | 9.2% |
| R | 4409788 | 8.0% |
| N | 4266550 | 7.7% |
| I | 4035031 | 7.3% |
| D | 3989481 | 7.2% |
| T | 3935075 | 7.1% |
| O | 3811449 | 6.9% |
| C | 3682307 | 6.7% |
| U | 3181760 | 5.8% |
| Other values (18) | 13249816 |
mediaType
Text
Missing 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 369838 |
| Missing (%) | 61.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 241 |
|---|---|
| Median length | 10 |
| Mean length | 15.46893794 |
| Min length | 10 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 192418 | |
| stillimage;stillimage;stillimage;stillimage | 14664 | 6.2% |
| stillimage;stillimage;stillimage | 10110 | 4.3% |
| stillimage;stillimage | 8407 | 3.6% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 5480 | 2.3% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 1551 | 0.7% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 1253 | 0.5% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 626 | 0.3% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 139 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 71 | < 0.1% |
| Other values (9) | 69 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 703038 | |
| S | 351519 | |
| t | 351519 | |
| i | 351519 | |
| I | 351519 | |
| m | 351519 | |
| a | 351519 | |
| g | 351519 | |
| e | 351519 | |
| ; | 116731 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2812152 | |
| Uppercase Letter | 703038 | 19.4% |
| Other Punctuation | 116731 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 703038 | |
| t | 351519 | |
| i | 351519 | |
| m | 351519 | |
| a | 351519 | |
| g | 351519 | |
| e | 351519 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 351519 | |
| I | 351519 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 116731 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3515190 | |
| Common | 116731 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 703038 | |
| S | 351519 | |
| t | 351519 | |
| i | 351519 | |
| I | 351519 | |
| m | 351519 | |
| a | 351519 | |
| g | 351519 | |
| e | 351519 |
Common
| Value | Count | Frequency (%) |
| ; | 116731 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3631921 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 703038 | |
| S | 351519 | |
| t | 351519 | |
| i | 351519 | |
| I | 351519 | |
| m | 351519 | |
| a | 351519 | |
| g | 351519 | |
| e | 351519 | |
| ; | 116731 | 3.2% |
hasCoordinate
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 4 |
| Mean length | 4.472394414 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | true |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 319051 | |
| false | 285571 | |
| trogoderma | 1 | < 0.1% |
| dejean | 1 | < 0.1% |
| 1821 | 1 | < 0.1% |
| aphytis | 1 | < 0.1% |
| roseni | 1 | < 0.1% |
| debach | 1 | < 0.1% |
| 1 | < 0.1% | |
| gordh | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 604627 | |
| r | 319055 | |
| t | 319052 | |
| u | 319051 | |
| a | 285574 | |
| s | 285573 | |
| f | 285571 | |
| l | 285571 | |
| 7 | < 0.1% | |
| o | 4 | < 0.1% |
| Other values (23) | 32 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2704093 | |
| Decimal Number | 8 | < 0.1% |
| Space Separator | 7 | < 0.1% |
| Uppercase Letter | 6 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 604627 | |
| r | 319055 | |
| t | 319052 | |
| u | 319051 | |
| a | 285574 | |
| s | 285573 | |
| f | 285571 | |
| l | 285571 | |
| o | 4 | < 0.1% |
| h | 3 | < 0.1% |
| Other values (9) | 12 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 7 | 1 | 12.5% |
| 9 | 1 | 12.5% |
| 2 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| 4 | 1 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2 | |
| T | 1 | |
| G | 1 | |
| B | 1 | |
| A | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 | |
| & | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2704099 | |
| Common | 18 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 604627 | |
| r | 319055 | |
| t | 319052 | |
| u | 319051 | |
| a | 285574 | |
| s | 285573 | |
| f | 285571 | |
| l | 285571 | |
| o | 4 | < 0.1% |
| h | 3 | < 0.1% |
| Other values (14) | 18 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| 1 | 3 | |
| , | 2 | 11.1% |
| 7 | 1 | 5.6% |
| 9 | 1 | 5.6% |
| & | 1 | 5.6% |
| 2 | 1 | 5.6% |
| 8 | 1 | 5.6% |
| 4 | 1 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2704117 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 604627 | |
| r | 319055 | |
| t | 319052 | |
| u | 319051 | |
| a | 285574 | |
| s | 285573 | |
| f | 285571 | |
| l | 285571 | |
| 7 | < 0.1% | |
| o | 4 | < 0.1% |
| Other values (23) | 32 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.99879098 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 603891 | |
| true | 731 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 604622 | |
| f | 603891 | |
| a | 603891 | |
| l | 603891 | |
| s | 603891 | |
| t | 731 | < 0.1% |
| r | 731 | < 0.1% |
| u | 731 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3022379 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 604622 | |
| f | 603891 | |
| a | 603891 | |
| l | 603891 | |
| s | 603891 | |
| t | 731 | < 0.1% |
| r | 731 | < 0.1% |
| u | 731 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3022379 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 604622 | |
| f | 603891 | |
| a | 603891 | |
| l | 603891 | |
| s | 603891 | |
| t | 731 | < 0.1% |
| r | 731 | < 0.1% |
| u | 731 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3022379 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 604622 | |
| f | 603891 | |
| a | 603891 | |
| l | 603891 | |
| s | 603891 | |
| t | 731 | < 0.1% |
| r | 731 | < 0.1% |
| u | 731 | < 0.1% |
taxonKey
Text
| Distinct | 203336 |
|---|---|
| Distinct (%) | 33.6% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.907198216 |
| Min length | 1 |
Unique
| Unique | 154756 ? |
|---|---|
| Unique (%) | 25.6% |
Sample
| 1st row | 7866975 |
|---|---|
| 2nd row | 5122189 |
| 3rd row | 1939887 |
| 4th row | 1422444 |
| 5th row | 7820915 |
| Value | Count | Frequency (%) |
| 1340278 | 10672 | 1.8% |
| 1340525 | 6264 | 1.0% |
| 0 | 4644 | 0.8% |
| 1340393 | 4071 | 0.7% |
| 10976534 | 3621 | 0.6% |
| 789 | 3466 | 0.6% |
| 1340467 | 3340 | 0.6% |
| 9164 | 3176 | 0.5% |
| 1340350 | 3129 | 0.5% |
| 1341979 | 2431 | 0.4% |
| Other values (203326) | 559808 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 694706 | |
| 4 | 514772 | |
| 0 | 431191 | |
| 2 | 416968 | |
| 3 | 414819 | |
| 5 | 390530 | |
| 9 | 337522 | |
| 8 | 337016 | |
| 7 | 334447 | |
| 6 | 304273 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4176244 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 694706 | |
| 4 | 514772 | |
| 0 | 431191 | |
| 2 | 416968 | |
| 3 | 414819 | |
| 5 | 390530 | |
| 9 | 337522 | |
| 8 | 337016 | |
| 7 | 334447 | |
| 6 | 304273 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4176244 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 694706 | |
| 4 | 514772 | |
| 0 | 431191 | |
| 2 | 416968 | |
| 3 | 414819 | |
| 5 | 390530 | |
| 9 | 337522 | |
| 8 | 337016 | |
| 7 | 334447 | |
| 6 | 304273 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4176244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 694706 | |
| 4 | 514772 | |
| 0 | 431191 | |
| 2 | 416968 | |
| 3 | 414819 | |
| 5 | 390530 | |
| 9 | 337522 | |
| 8 | 337016 | |
| 7 | 334447 | |
| 6 | 304273 |
acceptedTaxonKey
Text
| Distinct | 188378 |
|---|---|
| Distinct (%) | 31.4% |
| Missing | 4648 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.955165023 |
| Min length | 1 |
Unique
| Unique | 134600 ? |
|---|---|
| Unique (%) | 22.4% |
Sample
| 1st row | 7866975 |
|---|---|
| 2nd row | 5122189 |
| 3rd row | 1939887 |
| 4th row | 1422444 |
| 5th row | 4988370 |
| Value | Count | Frequency (%) |
| 1340278 | 10672 | 1.8% |
| 1340525 | 6265 | 1.0% |
| 1340393 | 4073 | 0.7% |
| 10409744 | 3623 | 0.6% |
| 789 | 3466 | 0.6% |
| 1340467 | 3343 | 0.6% |
| 9164 | 3176 | 0.5% |
| 1340350 | 3129 | 0.5% |
| 1341979 | 2431 | 0.4% |
| 1340485 | 2119 | 0.4% |
| Other values (188368) | 557681 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4172946 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4172946 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4172946 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 709890 | |
| 4 | 525521 | |
| 0 | 431132 | |
| 2 | 418685 | |
| 3 | 411620 | |
| 5 | 382598 | |
| 8 | 332590 | |
| 7 | 330905 | |
| 9 | 329832 | |
| 6 | 300173 |
kingdomKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 599978 | |
| 0 | 4644 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 599978 | |
| 0 | 4644 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 604622 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 599978 | |
| 0 | 4644 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 604622 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 599978 | |
| 0 | 4644 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 604622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 599978 | |
| 0 | 4644 | 0.8% |
phylumKey
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5247 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 54 |
|---|---|
| 2nd row | 54 |
| 3rd row | 54 |
| 4th row | 54 |
| 5th row | 54 |
| Value | Count | Frequency (%) |
| 54 | 599346 | |
| 43 | 18 | < 0.1% |
| 62 | 6 | < 0.1% |
| 52 | 5 | < 0.1% |
| 44 | 2 | < 0.1% |
| 63 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 599368 | |
| 5 | 599352 | |
| 3 | 19 | < 0.1% |
| 2 | 11 | < 0.1% |
| 6 | 7 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1198758 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 599368 | |
| 5 | 599352 | |
| 3 | 19 | < 0.1% |
| 2 | 11 | < 0.1% |
| 6 | 7 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1198758 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 599368 | |
| 5 | 599352 | |
| 3 | 19 | < 0.1% |
| 2 | 11 | < 0.1% |
| 6 | 7 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1198758 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 599368 | |
| 5 | 599352 | |
| 3 | 19 | < 0.1% |
| 2 | 11 | < 0.1% |
| 6 | 7 | < 0.1% |
| 0 | 1 | < 0.1% |
classKey
Text
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5283 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.008053819 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 216 |
|---|---|
| 2nd row | 216 |
| 3rd row | 216 |
| 4th row | 216 |
| 5th row | 216 |
| Value | Count | Frequency (%) |
| 216 | 588111 | |
| 367 | 7917 | 1.3% |
| 361 | 1599 | 0.3% |
| 10713444 | 820 | 0.1% |
| 360 | 736 | 0.1% |
| 11374670 | 77 | < 0.1% |
| 11377931 | 62 | < 0.1% |
| 7742773 | 8 | < 0.1% |
| 229 | 5 | < 0.1% |
| 143 | 4 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 598440 | |
| 1 | 591697 | |
| 2 | 588136 | |
| 3 | 11285 | 0.6% |
| 7 | 9047 | 0.5% |
| 4 | 2549 | 0.1% |
| 0 | 1633 | 0.1% |
| 9 | 67 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1802856 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 598440 | |
| 1 | 591697 | |
| 2 | 588136 | |
| 3 | 11285 | 0.6% |
| 7 | 9047 | 0.5% |
| 4 | 2549 | 0.1% |
| 0 | 1633 | 0.1% |
| 9 | 67 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1802856 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 598440 | |
| 1 | 591697 | |
| 2 | 588136 | |
| 3 | 11285 | 0.6% |
| 7 | 9047 | 0.5% |
| 4 | 2549 | 0.1% |
| 0 | 1633 | 0.1% |
| 9 | 67 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1802856 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 598440 | |
| 1 | 591697 | |
| 2 | 588136 | |
| 3 | 11285 | 0.6% |
| 7 | 9047 | 0.5% |
| 4 | 2549 | 0.1% |
| 0 | 1633 | 0.1% |
| 9 | 67 | < 0.1% |
| 5 | 2 | < 0.1% |
orderKey
Text
| Distinct | 74 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5577 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 55 |
|---|---|
| Median length | 3 |
| Mean length | 3.462212607 |
| Min length | 3 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1457 |
|---|---|
| 2nd row | 797 |
| 3rd row | 797 |
| 4th row | 789 |
| 5th row | 1470 |
| Value | Count | Frequency (%) |
| 1457 | 146330 | |
| 789 | 117284 | |
| 797 | 99491 | |
| 811 | 73566 | |
| 1470 | 71961 | |
| 809 | 37757 | 6.3% |
| 1366 | 10087 | 1.7% |
| 1003 | 9104 | 1.5% |
| 1228 | 4628 | 0.8% |
| 1496 | 4624 | 0.8% |
| Other values (69) | 24225 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 545010 | |
| 1 | 417485 | |
| 9 | 260222 | |
| 8 | 248345 | |
| 4 | 231789 | |
| 5 | 157191 | 7.6% |
| 0 | 140519 | 6.8% |
| 6 | 30519 | 1.5% |
| 3 | 24309 | 1.2% |
| 2 | 18537 | 0.9% |
| Other values (23) | 109 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2073926 | |
| Lowercase Letter | 83 | < 0.1% |
| Uppercase Letter | 10 | < 0.1% |
| Space Separator | 8 | < 0.1% |
| Other Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12 | |
| e | 10 | |
| i | 7 | |
| r | 7 | |
| t | 7 | |
| o | 7 | |
| n | 6 | |
| p | 6 | |
| d | 4 | 4.8% |
| l | 4 | 4.8% |
| Other values (6) | 13 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 545010 | |
| 1 | 417485 | |
| 9 | 260222 | |
| 8 | 248345 | |
| 4 | 231789 | |
| 5 | 157191 | 7.6% |
| 0 | 140519 | 6.8% |
| 6 | 30519 | 1.5% |
| 3 | 24309 | 1.2% |
| 2 | 18537 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5 | |
| I | 2 | 20.0% |
| C | 1 | 10.0% |
| B | 1 | 10.0% |
| H | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2073942 | |
| Latin | 93 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12 | |
| e | 10 | |
| i | 7 | 7.5% |
| r | 7 | 7.5% |
| t | 7 | 7.5% |
| o | 7 | 7.5% |
| n | 6 | 6.5% |
| p | 6 | 6.5% |
| A | 5 | 5.4% |
| d | 4 | 4.3% |
| Other values (11) | 22 |
Common
| Value | Count | Frequency (%) |
| 7 | 545010 | |
| 1 | 417485 | |
| 9 | 260222 | |
| 8 | 248345 | |
| 4 | 231789 | |
| 5 | 157191 | 7.6% |
| 0 | 140519 | 6.8% |
| 6 | 30519 | 1.5% |
| 3 | 24309 | 1.2% |
| 2 | 18537 | 0.9% |
| Other values (2) | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2074035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 545010 | |
| 1 | 417485 | |
| 9 | 260222 | |
| 8 | 248345 | |
| 4 | 231789 | |
| 5 | 157191 | 7.6% |
| 0 | 140519 | 6.8% |
| 6 | 30519 | 1.5% |
| 3 | 24309 | 1.2% |
| 2 | 18537 | 0.9% |
| Other values (23) | 109 | < 0.1% |
familyKey
Text
Missing 
| Distinct | 1493 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 11642 |
| Missing (%) | 1.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.092565735 |
| Min length | 4 |
Unique
| Unique | 194 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 4342 |
|---|---|
| 2nd row | 3553 |
| 3rd row | 5340 |
| 4th row | 8577 |
| 5th row | 3792 |
| Value | Count | Frequency (%) |
| 4334 | 82646 | 13.9% |
| 5936 | 42503 | 7.2% |
| 8577 | 36255 | 6.1% |
| 7780 | 17448 | 2.9% |
| 8841 | 13614 | 2.3% |
| 7275 | 13374 | 2.3% |
| 6950 | 12793 | 2.2% |
| 9164 | 11788 | 2.0% |
| 4239 | 11689 | 2.0% |
| 4342 | 9878 | 1.7% |
| Other values (1483) | 340996 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 429068 | |
| 3 | 397892 | |
| 7 | 294264 | |
| 5 | 278623 | |
| 9 | 230925 | |
| 8 | 216518 | |
| 6 | 162474 | 6.7% |
| 0 | 149228 | 6.1% |
| 2 | 136060 | 5.6% |
| 1 | 131758 | 5.4% |
| Other values (6) | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2426810 | |
| Lowercase Letter | 14 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 429068 | |
| 3 | 397892 | |
| 7 | 294264 | |
| 5 | 278623 | |
| 9 | 230925 | |
| 8 | 216518 | |
| 6 | 162474 | 6.7% |
| 0 | 149228 | 6.1% |
| 2 | 136060 | 5.6% |
| 1 | 131758 | 5.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| n | 2 | |
| m | 2 | |
| l | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2426810 | |
| Latin | 16 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 429068 | |
| 3 | 397892 | |
| 7 | 294264 | |
| 5 | 278623 | |
| 9 | 230925 | |
| 8 | 216518 | |
| 6 | 162474 | 6.7% |
| 0 | 149228 | 6.1% |
| 2 | 136060 | 5.6% |
| 1 | 131758 | 5.4% |
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| A | 2 | |
| n | 2 | |
| m | 2 | |
| l | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2426826 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 429068 | |
| 3 | 397892 | |
| 7 | 294264 | |
| 5 | 278623 | |
| 9 | 230925 | |
| 8 | 216518 | |
| 6 | 162474 | 6.7% |
| 0 | 149228 | 6.1% |
| 2 | 136060 | 5.6% |
| 1 | 131758 | 5.4% |
| Other values (6) | 16 | < 0.1% |
genusKey
Text
Missing 
| Distinct | 35818 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 19883 |
| Missing (%) | 3.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.009882974 |
| Min length | 7 |
Unique
| Unique | 11846 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | 1312361 |
|---|---|
| 2nd row | 1851754 |
| 3rd row | 7876391 |
| 4th row | 1422438 |
| 5th row | 4988347 |
| Value | Count | Frequency (%) |
| 1340278 | 62386 | 10.7% |
| 1342048 | 11739 | 2.0% |
| 1422607 | 8660 | 1.5% |
| 1422099 | 7903 | 1.4% |
| 1879915 | 7885 | 1.3% |
| 1423281 | 7465 | 1.3% |
| 1428195 | 6026 | 1.0% |
| 1334757 | 4967 | 0.8% |
| 1428967 | 4175 | 0.7% |
| 1423980 | 4149 | 0.7% |
| Other values (35808) | 459388 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 731946 | |
| 4 | 516466 | |
| 2 | 477202 | |
| 0 | 399327 | |
| 3 | 388191 | |
| 7 | 374798 | |
| 8 | 359928 | |
| 9 | 304581 | |
| 6 | 276553 | 6.7% |
| 5 | 269968 | 6.6% |
| Other values (8) | 20 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4098960 | |
| Lowercase Letter | 18 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 731946 | |
| 4 | 516466 | |
| 2 | 477202 | |
| 0 | 399327 | |
| 3 | 388191 | |
| 7 | 374798 | |
| 8 | 359928 | |
| 9 | 304581 | |
| 6 | 276553 | 6.7% |
| 5 | 269968 | 6.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 4 | |
| o | 4 | |
| t | 2 | |
| h | 2 | |
| p | 2 | |
| d | 2 | |
| a | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4098960 | |
| Latin | 20 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 731946 | |
| 4 | 516466 | |
| 2 | 477202 | |
| 0 | 399327 | |
| 3 | 388191 | |
| 7 | 374798 | |
| 8 | 359928 | |
| 9 | 304581 | |
| 6 | 276553 | 6.7% |
| 5 | 269968 | 6.6% |
Latin
| Value | Count | Frequency (%) |
| r | 4 | |
| o | 4 | |
| A | 2 | |
| t | 2 | |
| h | 2 | |
| p | 2 | |
| d | 2 | |
| a | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4098980 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 731946 | |
| 4 | 516466 | |
| 2 | 477202 | |
| 0 | 399327 | |
| 3 | 388191 | |
| 7 | 374798 | |
| 8 | 359928 | |
| 9 | 304581 | |
| 6 | 276553 | 6.7% |
| 5 | 269968 | 6.6% |
| Other values (8) | 20 | < 0.1% |
subgenusKey
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604624 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Insecta |
|---|---|
| 2nd row | Insecta |
| Value | Count | Frequency (%) |
| insecta | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 2 | |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 2 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 2 | |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 2 | |
| n | 2 | |
| s | 2 | |
| e | 2 | |
| c | 2 | |
| t | 2 | |
| a | 2 |
speciesKey
Text
Missing 
| Distinct | 169008 |
|---|---|
| Distinct (%) | 34.1% |
| Missing | 109501 |
| Missing (%) | 18.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.040145418 |
| Min length | 7 |
Unique
| Unique | 121718 ? |
|---|---|
| Unique (%) | 24.6% |
Sample
| 1st row | 1313073 |
|---|---|
| 2nd row | 5122189 |
| 3rd row | 1939887 |
| 4th row | 1422444 |
| 5th row | 4988370 |
| Value | Count | Frequency (%) |
| 1340525 | 6265 | 1.3% |
| 1340393 | 4073 | 0.8% |
| 10409744 | 3623 | 0.7% |
| 1340467 | 3343 | 0.7% |
| 1340350 | 3129 | 0.6% |
| 1341979 | 2431 | 0.5% |
| 1340485 | 2119 | 0.4% |
| 1340382 | 1985 | 0.4% |
| 1419322 | 1947 | 0.4% |
| 1423305 | 1920 | 0.4% |
| Other values (168998) | 464290 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 616872 | |
| 4 | 435743 | |
| 0 | 362737 | |
| 3 | 350994 | |
| 2 | 349089 | |
| 5 | 335200 | |
| 9 | 275869 | |
| 8 | 271905 | |
| 7 | 255954 | |
| 6 | 231368 | 6.6% |
| Other values (12) | 21 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3485731 | |
| Lowercase Letter | 19 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 616872 | |
| 4 | 435743 | |
| 0 | 362737 | |
| 3 | 350994 | |
| 2 | 349089 | |
| 5 | 335200 | |
| 9 | 275869 | |
| 8 | 271905 | |
| 7 | 255954 | |
| 6 | 231368 | 6.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4 | |
| o | 3 | |
| p | 2 | |
| t | 2 | |
| r | 2 | |
| a | 2 | |
| l | 1 | 5.3% |
| y | 1 | 5.3% |
| m | 1 | 5.3% |
| n | 1 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| H | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3485731 | |
| Latin | 21 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4 | |
| o | 3 | |
| p | 2 | |
| t | 2 | |
| r | 2 | |
| a | 2 | |
| l | 1 | 4.8% |
| C | 1 | 4.8% |
| H | 1 | 4.8% |
| y | 1 | 4.8% |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 1 | 616872 | |
| 4 | 435743 | |
| 0 | 362737 | |
| 3 | 350994 | |
| 2 | 349089 | |
| 5 | 335200 | |
| 9 | 275869 | |
| 8 | 271905 | |
| 7 | 255954 | |
| 6 | 231368 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3485752 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 616872 | |
| 4 | 435743 | |
| 0 | 362737 | |
| 3 | 350994 | |
| 2 | 349089 | |
| 5 | 335200 | |
| 9 | 275869 | |
| 8 | 271905 | |
| 7 | 255954 | |
| 6 | 231368 | 6.6% |
| Other values (12) | 21 | < 0.1% |
species
Text
Missing 
| Distinct | 168987 |
|---|---|
| Distinct (%) | 34.1% |
| Missing | 109503 |
| Missing (%) | 18.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 32 |
| Mean length | 18.63352945 |
| Min length | 6 |
Unique
| Unique | 121697 ? |
|---|---|
| Unique (%) | 24.6% |
Sample
| 1st row | Camponotus rufoglaucus |
|---|---|
| 2nd row | Athrips mesoleuca |
| 3rd row | Paranthrene asilipennis |
| 4th row | Acanthagrion trilobatum |
| 5th row | Calathus ingratus |
| Value | Count | Frequency (%) |
| bombus | 51714 | 5.2% |
| xylocopa | 9795 | 1.0% |
| argia | 8430 | 0.9% |
| enallagma | 7850 | 0.8% |
| crambus | 7738 | 0.8% |
| ischnura | 7433 | 0.8% |
| sylvicola | 6290 | 0.6% |
| sympetrum | 5960 | 0.6% |
| apis | 4956 | 0.5% |
| lestes | 4143 | 0.4% |
| Other values (101139) | 875937 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1016887 | 11.0% |
| i | 815089 | 8.8% |
| s | 718014 | 7.8% |
| e | 632080 | 6.9% |
| o | 593441 | 6.4% |
| r | 547183 | 5.9% |
| l | 514243 | 5.6% |
| 495123 | 5.4% | |
| u | 469909 | 5.1% |
| n | 449274 | 4.9% |
| Other values (44) | 2974646 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8235554 | |
| Space Separator | 495123 | 5.4% |
| Uppercase Letter | 495123 | 5.4% |
| Dash Punctuation | 89 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1016887 | |
| i | 815089 | 9.9% |
| s | 718014 | 8.7% |
| e | 632080 | 7.7% |
| o | 593441 | 7.2% |
| r | 547183 | 6.6% |
| l | 514243 | 6.2% |
| u | 469909 | 5.7% |
| n | 449274 | 5.5% |
| t | 427269 | 5.2% |
| Other values (16) | 2052165 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 64094 | |
| P | 59463 | |
| C | 54933 | |
| A | 54809 | |
| E | 36639 | 7.4% |
| S | 29916 | 6.0% |
| L | 25361 | 5.1% |
| H | 23318 | 4.7% |
| M | 22563 | 4.6% |
| T | 21616 | 4.4% |
| Other values (16) | 102411 |
Space Separator
| Value | Count | Frequency (%) |
| 495123 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 89 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8730677 | |
| Common | 495212 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1016887 | 11.6% |
| i | 815089 | 9.3% |
| s | 718014 | 8.2% |
| e | 632080 | 7.2% |
| o | 593441 | 6.8% |
| r | 547183 | 6.3% |
| l | 514243 | 5.9% |
| u | 469909 | 5.4% |
| n | 449274 | 5.1% |
| t | 427269 | 4.9% |
| Other values (42) | 2547288 |
Common
| Value | Count | Frequency (%) |
| 495123 | ||
| - | 89 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9225889 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1016887 | 11.0% |
| i | 815089 | 8.8% |
| s | 718014 | 7.8% |
| e | 632080 | 6.9% |
| o | 593441 | 6.4% |
| r | 547183 | 5.9% |
| l | 514243 | 5.6% |
| 495123 | 5.4% | |
| u | 469909 | 5.1% |
| n | 449274 | 4.9% |
| Other values (44) | 2974646 |
| Distinct | 188378 |
|---|---|
| Distinct (%) | 31.4% |
| Missing | 4646 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 239 |
|---|---|
| Median length | 106 |
| Mean length | 31.58040101 |
| Min length | 5 |
Unique
| Unique | 134599 ? |
|---|---|
| Unique (%) | 22.4% |
Sample
| 1st row | Camponotus rufoglaucus var. rufigenis Forel |
|---|---|
| 2nd row | Athrips mesoleuca Lower, 1900 |
| 3rd row | Paranthrene asilipennis (Boisduval, 1832) |
| 4th row | Acanthagrion trilobatum Leonard, 1977 |
| 5th row | Calathus ingratus Dejean, 1828 |
| Value | Count | Frequency (%) |
| bombus | 62386 | 2.7% |
| 28889 | 1.2% | |
| hagen | 24360 | 1.0% |
| cresson | 24243 | 1.0% |
| 1861 | 18841 | 0.8% |
| fabricius | 17279 | 0.7% |
| 1863 | 16815 | 0.7% |
| selys | 16399 | 0.7% |
| say | 15686 | 0.7% |
| latreille | 15381 | 0.7% |
| Other values (114566) | 2087838 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1728137 | 9.1% | |
| a | 1474905 | 7.8% |
| e | 1198055 | 6.3% |
| i | 1144904 | 6.0% |
| s | 1048999 | 5.5% |
| r | 967679 | 5.1% |
| o | 892139 | 4.7% |
| l | 791989 | 4.2% |
| n | 763156 | 4.0% |
| 1 | 665031 | 3.5% |
| Other values (98) | 8272615 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12670283 | |
| Decimal Number | 2280664 | 12.0% |
| Space Separator | 1728137 | 9.1% |
| Uppercase Letter | 1241501 | 6.6% |
| Other Punctuation | 615070 | 3.2% |
| Close Punctuation | 203519 | 1.1% |
| Open Punctuation | 203519 | 1.1% |
| Dash Punctuation | 4916 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1474905 | |
| e | 1198055 | 9.5% |
| i | 1144904 | 9.0% |
| s | 1048999 | 8.3% |
| r | 967679 | 7.6% |
| o | 892139 | 7.0% |
| l | 791989 | 6.3% |
| n | 763156 | 6.0% |
| u | 647333 | 5.1% |
| t | 632706 | 5.0% |
| Other values (47) | 3108418 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 146954 | |
| B | 129211 | 10.4% |
| S | 114140 | 9.2% |
| P | 91093 | 7.3% |
| A | 85530 | 6.9% |
| H | 84070 | 6.8% |
| L | 83036 | 6.7% |
| M | 65023 | 5.2% |
| D | 57691 | 4.6% |
| E | 51249 | 4.1% |
| Other values (23) | 333504 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 665031 | |
| 8 | 396544 | |
| 9 | 315091 | |
| 7 | 168448 | 7.4% |
| 3 | 135245 | 5.9% |
| 6 | 131330 | 5.8% |
| 0 | 130379 | 5.7% |
| 2 | 125887 | 5.5% |
| 5 | 111491 | 4.9% |
| 4 | 101218 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 572025 | |
| & | 28889 | 4.7% |
| . | 13983 | 2.3% |
| ' | 173 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1728137 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 203519 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 203519 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4916 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13911784 | |
| Common | 5035825 | 26.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1474905 | 10.6% |
| e | 1198055 | 8.6% |
| i | 1144904 | 8.2% |
| s | 1048999 | 7.5% |
| r | 967679 | 7.0% |
| o | 892139 | 6.4% |
| l | 791989 | 5.7% |
| n | 763156 | 5.5% |
| u | 647333 | 4.7% |
| t | 632706 | 4.5% |
| Other values (80) | 4349919 |
Common
| Value | Count | Frequency (%) |
| 1728137 | ||
| 1 | 665031 | 13.2% |
| , | 572025 | 11.4% |
| 8 | 396544 | 7.9% |
| 9 | 315091 | 6.3% |
| ) | 203519 | 4.0% |
| ( | 203519 | 4.0% |
| 7 | 168448 | 3.3% |
| 3 | 135245 | 2.7% |
| 6 | 131330 | 2.6% |
| Other values (8) | 516936 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18921976 | |
| None | 25633 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1728137 | 9.1% | |
| a | 1474905 | 7.8% |
| e | 1198055 | 6.3% |
| i | 1144904 | 6.1% |
| s | 1048999 | 5.5% |
| r | 967679 | 5.1% |
| o | 892139 | 4.7% |
| l | 791989 | 4.2% |
| n | 763156 | 4.0% |
| 1 | 665031 | 3.5% |
| Other values (60) | 8246982 |
None
| Value | Count | Frequency (%) |
| é | 9332 | |
| ü | 5958 | |
| ö | 3342 | 13.0% |
| å | 1810 | 7.1% |
| á | 1321 | 5.2% |
| ä | 1318 | 5.1% |
| ç | 861 | 3.4% |
| è | 779 | 3.0% |
| ó | 203 | 0.8% |
| í | 132 | 0.5% |
| Other values (28) | 577 | 2.3% |
| Distinct | 245043 |
|---|---|
| Distinct (%) | 40.8% |
| Missing | 4630 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 61 |
| Mean length | 20.7704068 |
| Min length | 3 |
Unique
| Unique | 201366 ? |
|---|---|
| Unique (%) | 33.6% |
Sample
| 1st row | Camponotus (Myrmosericus) rufoglaucus cinctella var. rufigenis |
|---|---|
| 2nd row | Athrips mesoleuca |
| 3rd row | Paranthrene asilipennis |
| 4th row | Acanthagrion trilobatum |
| 5th row | Calathus nanulus |
| Value | Count | Frequency (%) |
| bombus | 69588 | 5.3% |
| sp | 44392 | 3.4% |
| pyrobombus | 21248 | 1.6% |
| xylocopa | 12219 | 0.9% |
| unidentified | 9028 | 0.7% |
| argia | 8663 | 0.7% |
| apis | 8601 | 0.6% |
| enallagma | 7977 | 0.6% |
| crambus | 7970 | 0.6% |
| ischnura | 7456 | 0.6% |
| Other values (130808) | 1127237 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1253913 | 10.1% |
| i | 1043196 | 8.4% |
| s | 971230 | 7.8% |
| o | 842744 | 6.8% |
| e | 820779 | 6.6% |
| 724383 | 5.8% | |
| r | 712701 | 5.7% |
| l | 623014 | 5.0% |
| u | 614900 | 4.9% |
| n | 589792 | 4.7% |
| Other values (72) | 4265509 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10813409 | |
| Space Separator | 724383 | 5.8% |
| Uppercase Letter | 692082 | 5.6% |
| Open Punctuation | 92264 | 0.7% |
| Close Punctuation | 92262 | 0.7% |
| Other Punctuation | 46446 | 0.4% |
| Decimal Number | 742 | < 0.1% |
| Connector Punctuation | 312 | < 0.1% |
| Dash Punctuation | 259 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1253913 | |
| i | 1043196 | 9.6% |
| s | 971230 | 9.0% |
| o | 842744 | 7.8% |
| e | 820779 | 7.6% |
| r | 712701 | 6.6% |
| l | 623014 | 5.8% |
| u | 614900 | 5.7% |
| n | 589792 | 5.5% |
| t | 542837 | 5.0% |
| Other values (18) | 2798303 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 97574 | |
| B | 85590 | |
| A | 75780 | |
| C | 69816 | |
| S | 43668 | 6.3% |
| E | 42642 | 6.2% |
| L | 33323 | 4.8% |
| M | 31758 | 4.6% |
| T | 31185 | 4.5% |
| H | 29102 | 4.2% |
| Other values (16) | 151644 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 216 | |
| 9 | 110 | |
| 0 | 93 | |
| 2 | 79 | 10.6% |
| 3 | 67 | 9.0% |
| 4 | 55 | 7.4% |
| 6 | 44 | 5.9% |
| 5 | 30 | 4.0% |
| 7 | 30 | 4.0% |
| 8 | 18 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46196 | |
| ? | 109 | 0.2% |
| " | 84 | 0.2% |
| # | 34 | 0.1% |
| / | 14 | < 0.1% |
| , | 4 | < 0.1% |
| ; | 2 | < 0.1% |
| ' | 2 | < 0.1% |
| ! | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 92206 | |
| [ | 58 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 92204 | |
| ] | 58 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 724383 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 312 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 259 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11505491 | |
| Common | 956670 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1253913 | 10.9% |
| i | 1043196 | 9.1% |
| s | 971230 | 8.4% |
| o | 842744 | 7.3% |
| e | 820779 | 7.1% |
| r | 712701 | 6.2% |
| l | 623014 | 5.4% |
| u | 614900 | 5.3% |
| n | 589792 | 5.1% |
| t | 542837 | 4.7% |
| Other values (44) | 3490385 |
Common
| Value | Count | Frequency (%) |
| 724383 | ||
| ( | 92206 | 9.6% |
| ) | 92204 | 9.6% |
| . | 46196 | 4.8% |
| _ | 312 | < 0.1% |
| - | 259 | < 0.1% |
| 1 | 216 | < 0.1% |
| 9 | 110 | < 0.1% |
| ? | 109 | < 0.1% |
| 0 | 93 | < 0.1% |
| Other values (18) | 582 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12462139 | |
| None | 21 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1253913 | 10.1% |
| i | 1043196 | 8.4% |
| s | 971230 | 7.8% |
| o | 842744 | 6.8% |
| e | 820779 | 6.6% |
| 724383 | 5.8% | |
| r | 712701 | 5.7% |
| l | 623014 | 5.0% |
| u | 614900 | 4.9% |
| n | 589792 | 4.7% |
| Other values (69) | 4265487 |
None
| Value | Count | Frequency (%) |
| ö | 19 | |
| ñ | 2 | 9.5% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 604622 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 604622 | |
| M | 604622 | |
| L | 604622 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1813866 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 604622 | |
| M | 604622 | |
| L | 604622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1813866 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 604622 | |
| M | 604622 | |
| L | 604622 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1813866 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 604622 | |
| M | 604622 | |
| L | 604622 |
lastParsed
Text
| Distinct | 186894 |
|---|---|
| Distinct (%) | 30.9% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.9958007 |
| Min length | 7 |
Unique
| Unique | 38992 ? |
|---|---|
| Unique (%) | 6.4% |
Sample
| 1st row | 2024-12-02T13:57:44.315Z |
|---|---|
| 2nd row | 2024-12-02T13:57:18.321Z |
| 3rd row | 2024-12-02T13:59:05.381Z |
| 4th row | 2024-12-02T13:57:22.450Z |
| 5th row | 2024-12-02T13:57:21.275Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:45.539z | 16 | < 0.1% |
| 2024-12-02t13:57:59.931z | 16 | < 0.1% |
| 2024-12-02t13:57:53.908z | 16 | < 0.1% |
| 2024-12-02t13:57:26.378z | 16 | < 0.1% |
| 2024-12-02t13:57:29.420z | 15 | < 0.1% |
| 2024-12-02t13:56:43.735z | 15 | < 0.1% |
| 2024-12-02t13:57:51.108z | 15 | < 0.1% |
| 2024-12-02t13:58:53.448z | 15 | < 0.1% |
| 2024-12-02t13:56:41.760z | 15 | < 0.1% |
| 2024-12-02t13:57:19.226z | 15 | < 0.1% |
| Other values (186884) | 604470 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| - | 1209244 | |
| : | 1209244 | |
| 4 | 972748 | 6.7% |
| 5 | 960823 | 6.6% |
| 3 | 957684 | 6.6% |
| T | 604623 | 4.2% |
| Z | 604622 | 4.2% |
| Other values (19) | 2171290 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10276693 | |
| Other Punctuation | 1813239 | 12.5% |
| Uppercase Letter | 1209246 | 8.3% |
| Dash Punctuation | 1209244 | 8.3% |
| Lowercase Letter | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 2 | |
| g | 1 | 6.7% |
| d | 1 | 6.7% |
| e | 1 | 6.7% |
| m | 1 | 6.7% |
| a | 1 | 6.7% |
| p | 1 | 6.7% |
| h | 1 | 6.7% |
| y | 1 | 6.7% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| 4 | 972748 | 9.5% |
| 5 | 960823 | 9.3% |
| 3 | 957684 | 9.3% |
| 7 | 464169 | 4.5% |
| 9 | 387034 | 3.8% |
| 6 | 364187 | 3.5% |
| 8 | 351889 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 604623 | |
| Z | 604622 | |
| A | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1209244 | |
| . | 603995 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1209244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13299176 | |
| Latin | 1209261 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 604623 | |
| Z | 604622 | |
| r | 2 | < 0.1% |
| o | 2 | < 0.1% |
| g | 1 | < 0.1% |
| d | 1 | < 0.1% |
| e | 1 | < 0.1% |
| m | 1 | < 0.1% |
| a | 1 | < 0.1% |
| A | 1 | < 0.1% |
| Other values (6) | 6 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| - | 1209244 | |
| : | 1209244 | |
| 4 | 972748 | 7.3% |
| 5 | 960823 | 7.2% |
| 3 | 957684 | 7.2% |
| . | 603995 | 4.5% |
| 7 | 464169 | 3.5% |
| Other values (3) | 1103110 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14508437 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2760432 | |
| 0 | 1532584 | |
| 1 | 1525143 | |
| - | 1209244 | |
| : | 1209244 | |
| 4 | 972748 | 6.7% |
| 5 | 960823 | 6.6% |
| 3 | 957684 | 6.6% |
| T | 604623 | 4.2% |
| Z | 604622 | 4.2% |
| Other values (19) | 2171290 |
lastCrawled
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99994873 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 604622 | |
| trogoderma | 1 | < 0.1% |
| aphytis | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3023110 | |
| 1 | 2418488 | |
| 4 | 1813866 | |
| 0 | 1209244 | 8.3% |
| - | 1209244 | 8.3% |
| : | 1209244 | 8.3% |
| T | 604623 | 4.2% |
| 8 | 604622 | 4.2% |
| 3 | 604622 | 4.2% |
| . | 604622 | 4.2% |
| Other values (16) | 1209260 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10278574 | |
| Other Punctuation | 1813866 | 12.5% |
| Uppercase Letter | 1209246 | 8.3% |
| Dash Punctuation | 1209244 | 8.3% |
| Lowercase Letter | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2 | |
| r | 2 | |
| g | 1 | 6.7% |
| d | 1 | 6.7% |
| e | 1 | 6.7% |
| m | 1 | 6.7% |
| a | 1 | 6.7% |
| p | 1 | 6.7% |
| h | 1 | 6.7% |
| y | 1 | 6.7% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3023110 | |
| 1 | 2418488 | |
| 4 | 1813866 | |
| 0 | 1209244 | 11.8% |
| 8 | 604622 | 5.9% |
| 3 | 604622 | 5.9% |
| 6 | 604622 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 604623 | |
| Z | 604622 | |
| A | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1209244 | |
| . | 604622 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1209244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13301684 | |
| Latin | 1209261 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 604623 | |
| Z | 604622 | |
| o | 2 | < 0.1% |
| r | 2 | < 0.1% |
| g | 1 | < 0.1% |
| d | 1 | < 0.1% |
| e | 1 | < 0.1% |
| m | 1 | < 0.1% |
| a | 1 | < 0.1% |
| A | 1 | < 0.1% |
| Other values (6) | 6 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 3023110 | |
| 1 | 2418488 | |
| 4 | 1813866 | |
| 0 | 1209244 | 9.1% |
| - | 1209244 | 9.1% |
| : | 1209244 | 9.1% |
| 8 | 604622 | 4.5% |
| 3 | 604622 | 4.5% |
| . | 604622 | 4.5% |
| 6 | 604622 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14510945 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3023110 | |
| 1 | 2418488 | |
| 4 | 1813866 | |
| 0 | 1209244 | 8.3% |
| - | 1209244 | 8.3% |
| : | 1209244 | 8.3% |
| T | 604623 | 4.2% |
| 8 | 604622 | 4.2% |
| 3 | 604622 | 4.2% |
| . | 604622 | 4.2% |
| Other values (16) | 1209260 |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 162658 |
| Missing (%) | 26.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.492994968 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| true | 224080 | |
| false | 217888 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 441968 | |
| t | 224080 | |
| r | 224080 | |
| u | 224080 | |
| f | 217888 | |
| a | 217888 | |
| l | 217888 | |
| s | 217888 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1985760 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 441968 | |
| t | 224080 | |
| r | 224080 | |
| u | 224080 | |
| f | 217888 | |
| a | 217888 | |
| l | 217888 | |
| s | 217888 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1985760 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 441968 | |
| t | 224080 | |
| r | 224080 | |
| u | 224080 | |
| f | 217888 | |
| a | 217888 | |
| l | 217888 | |
| s | 217888 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1985760 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 441968 | |
| t | 224080 | |
| r | 224080 | |
| u | 224080 | |
| f | 217888 | |
| a | 217888 | |
| l | 217888 | |
| s | 217888 |
projectId
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604625 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | roseni |
|---|
| Value | Count | Frequency (%) |
| roseni | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1 | |
| o | 1 | |
| s | 1 | |
| e | 1 | |
| n | 1 | |
| i | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1 | |
| o | 1 | |
| s | 1 | |
| e | 1 | |
| n | 1 | |
| i | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 1 | |
| o | 1 | |
| s | 1 | |
| e | 1 | |
| n | 1 | |
| i | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 1 | |
| o | 1 | |
| s | 1 | |
| e | 1 | |
| n | 1 | |
| i | 1 |
isSequenced
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 604622 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 604622 | |
| a | 604622 | |
| l | 604622 | |
| s | 604622 | |
| e | 604622 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3023110 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 604622 | |
| a | 604622 | |
| l | 604622 | |
| s | 604622 | |
| e | 604622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3023110 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 604622 | |
| a | 604622 | |
| l | 604622 | |
| s | 604622 | |
| e | 604622 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3023110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 604622 | |
| a | 604622 | |
| l | 604622 | |
| s | 604622 | |
| e | 604622 |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 163113 |
| Missing (%) | 27.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.14388478 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | LATIN_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 234151 | |
| latin_america | 104373 | |
| asia | 55886 | 12.7% |
| africa | 22020 | 5.0% |
| oceania | 13164 | 3.0% |
| europe | 11911 | 2.7% |
| antarctica | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 963585 | |
| R | 606614 | |
| I | 533975 | |
| E | 375510 | 7.6% |
| C | 373724 | 7.6% |
| N | 351696 | 7.1% |
| T | 338540 | 6.9% |
| _ | 338524 | 6.9% |
| M | 338524 | 6.9% |
| O | 259226 | 5.3% |
| Other values (6) | 440252 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4581646 | |
| Connector Punctuation | 338524 | 6.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 963585 | |
| R | 606614 | |
| I | 533975 | |
| E | 375510 | 8.2% |
| C | 373724 | 8.2% |
| N | 351696 | 7.7% |
| T | 338540 | 7.4% |
| M | 338524 | 7.4% |
| O | 259226 | 5.7% |
| H | 234151 | 5.1% |
| Other values (5) | 206101 | 4.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 338524 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4581646 | |
| Common | 338524 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 963585 | |
| R | 606614 | |
| I | 533975 | |
| E | 375510 | 8.2% |
| C | 373724 | 8.2% |
| N | 351696 | 7.7% |
| T | 338540 | 7.4% |
| M | 338524 | 7.4% |
| O | 259226 | 5.7% |
| H | 234151 | 5.1% |
| Other values (5) | 206101 | 4.5% |
Common
| Value | Count | Frequency (%) |
| _ | 338524 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4920170 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 963585 | |
| R | 606614 | |
| I | 533975 | |
| E | 375510 | 7.6% |
| C | 373724 | 7.6% |
| N | 351696 | 7.1% |
| T | 338540 | 6.9% |
| _ | 338524 | 6.9% |
| M | 338524 | 6.9% |
| O | 259226 | 5.3% |
| Other values (6) | 440252 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.99997685 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 604622 | |
| genus | 1 | < 0.1% |
| species | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1209244 | |
| A | 1209244 | |
| E | 604625 | |
| N | 604623 | |
| I | 604623 | |
| C | 604623 | |
| O | 604622 | |
| T | 604622 | |
| H | 604622 | |
| _ | 604622 | |
| Other values (5) | 604628 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7255476 | |
| Connector Punctuation | 604622 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1209244 | |
| A | 1209244 | |
| E | 604625 | |
| N | 604623 | |
| I | 604623 | |
| C | 604623 | |
| O | 604622 | |
| T | 604622 | |
| H | 604622 | |
| M | 604622 | |
| Other values (4) | 6 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 604622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7255476 | |
| Common | 604622 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1209244 | |
| A | 1209244 | |
| E | 604625 | |
| N | 604623 | |
| I | 604623 | |
| C | 604623 | |
| O | 604622 | |
| T | 604622 | |
| H | 604622 | |
| M | 604622 | |
| Other values (4) | 6 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 604622 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7860098 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1209244 | |
| A | 1209244 | |
| E | 604625 | |
| N | 604623 | |
| I | 604623 | |
| C | 604623 | |
| O | 604622 | |
| T | 604622 | |
| H | 604622 | |
| _ | 604622 | |
| Other values (5) | 604628 |
level0Gid
Text
Missing 
| Distinct | 212 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 288722 |
| Missing (%) | 47.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CRI |
|---|---|
| 2nd row | USA |
| 3rd row | USA |
| 4th row | DMA |
| 5th row | CAN |
| Value | Count | Frequency (%) |
| usa | 196159 | |
| can | 14651 | 4.6% |
| mex | 5495 | 1.7% |
| bra | 4604 | 1.5% |
| cri | 4530 | 1.4% |
| chl | 4046 | 1.3% |
| zaf | 3361 | 1.1% |
| ind | 3261 | 1.0% |
| ken | 3246 | 1.0% |
| arg | 3226 | 1.0% |
| Other values (202) | 73325 | 23.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 237901 | |
| U | 211103 | |
| S | 206824 | |
| N | 40079 | 4.2% |
| C | 33198 | 3.5% |
| R | 25245 | 2.7% |
| E | 23068 | 2.4% |
| M | 20525 | 2.2% |
| L | 15881 | 1.7% |
| G | 15551 | 1.6% |
| Other values (19) | 118337 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 947676 | |
| Decimal Number | 36 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 237901 | |
| U | 211103 | |
| S | 206824 | |
| N | 40079 | 4.2% |
| C | 33198 | 3.5% |
| R | 25245 | 2.7% |
| E | 23068 | 2.4% |
| M | 20525 | 2.2% |
| L | 15881 | 1.7% |
| G | 15551 | 1.6% |
| Other values (16) | 118301 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 7 | 10 | |
| 1 | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 947676 | |
| Common | 36 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 237901 | |
| U | 211103 | |
| S | 206824 | |
| N | 40079 | 4.2% |
| C | 33198 | 3.5% |
| R | 25245 | 2.7% |
| E | 23068 | 2.4% |
| M | 20525 | 2.2% |
| L | 15881 | 1.7% |
| G | 15551 | 1.6% |
| Other values (16) | 118301 |
Common
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 7 | 10 | |
| 1 | 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 947712 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 237901 | |
| U | 211103 | |
| S | 206824 | |
| N | 40079 | 4.2% |
| C | 33198 | 3.5% |
| R | 25245 | 2.7% |
| E | 23068 | 2.4% |
| M | 20525 | 2.2% |
| L | 15881 | 1.7% |
| G | 15551 | 1.6% |
| Other values (19) | 118337 |
level0Name
Text
Missing 
| Distinct | 212 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 288722 |
| Missing (%) | 47.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 13 |
| Mean length | 11.1129552 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Costa Rica |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | Dominica |
| 5th row | Canada |
| Value | Count | Frequency (%) |
| united | 198236 | |
| states | 196177 | |
| canada | 14651 | 2.7% |
| méxico | 5495 | 1.0% |
| brazil | 4604 | 0.9% |
| costa | 4530 | 0.8% |
| rica | 4530 | 0.8% |
| chile | 4046 | 0.7% |
| south | 3768 | 0.7% |
| africa | 3361 | 0.6% |
| Other values (247) | 102079 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 614835 | |
| e | 449723 | |
| a | 371695 | |
| n | 279068 | |
| i | 278571 | |
| d | 238311 | 6.8% |
| 225573 | 6.4% | |
| s | 217681 | 6.2% |
| S | 208349 | 5.9% |
| U | 199271 | 5.7% |
| Other values (52) | 427550 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2745598 | |
| Uppercase Letter | 538709 | 15.3% |
| Space Separator | 225573 | 6.4% |
| Other Punctuation | 734 | < 0.1% |
| Dash Punctuation | 11 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 614835 | |
| e | 449723 | |
| a | 371695 | |
| n | 279068 | |
| i | 278571 | |
| d | 238311 | 8.7% |
| s | 217681 | 7.9% |
| o | 45403 | 1.7% |
| r | 41231 | 1.5% |
| l | 34955 | 1.3% |
| Other values (21) | 174125 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 208349 | |
| U | 199271 | |
| C | 29139 | 5.4% |
| M | 10460 | 1.9% |
| A | 10110 | 1.9% |
| G | 8819 | 1.6% |
| P | 8642 | 1.6% |
| R | 8600 | 1.6% |
| B | 8499 | 1.6% |
| I | 8140 | 1.5% |
| Other values (14) | 38680 | 7.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 412 | |
| , | 223 | |
| ' | 99 | 13.5% |
Space Separator
| Value | Count | Frequency (%) |
| 225573 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3284307 | |
| Common | 226320 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 614835 | |
| e | 449723 | |
| a | 371695 | |
| n | 279068 | |
| i | 278571 | |
| d | 238311 | 7.3% |
| s | 217681 | 6.6% |
| S | 208349 | 6.3% |
| U | 199271 | 6.1% |
| o | 45403 | 1.4% |
| Other values (45) | 381400 |
Common
| Value | Count | Frequency (%) |
| 225573 | ||
| . | 412 | 0.2% |
| , | 223 | 0.1% |
| ' | 99 | < 0.1% |
| - | 11 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3504956 | |
| None | 5671 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 614835 | |
| e | 449723 | |
| a | 371695 | |
| n | 279068 | |
| i | 278571 | |
| d | 238311 | 6.8% |
| 225573 | 6.4% | |
| s | 217681 | 6.2% |
| S | 208349 | 5.9% |
| U | 199271 | 5.7% |
| Other values (47) | 421879 |
None
| Value | Count | Frequency (%) |
| é | 5502 | |
| ô | 99 | 1.7% |
| ç | 56 | 1.0% |
| ã | 7 | 0.1% |
| í | 7 | 0.1% |
level1Gid
Text
Missing 
| Distinct | 1995 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 288806 |
| Missing (%) | 47.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.612196821 |
| Min length | 6 |
Unique
| Unique | 306 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | CRI.2_1 |
|---|---|
| 2nd row | USA.2_1 |
| 3rd row | USA.47_1 |
| 4th row | DMA.4_1 |
| 5th row | CAN.11_1 |
| Value | Count | Frequency (%) |
| usa.5_1 | 21189 | 6.7% |
| usa.6_1 | 19719 | 6.2% |
| usa.47_1 | 14927 | 4.7% |
| usa.3_1 | 11623 | 3.7% |
| usa.44_1 | 9899 | 3.1% |
| usa.21_1 | 8906 | 2.8% |
| usa.10_1 | 8599 | 2.7% |
| usa.15_1 | 7690 | 2.4% |
| usa.48_1 | 6994 | 2.2% |
| can.13_1 | 6708 | 2.1% |
| Other values (1985) | 199566 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 418331 | |
| _ | 315801 | |
| . | 315702 | |
| A | 237898 | |
| U | 211047 | |
| S | 206822 | |
| 4 | 78148 | 3.3% |
| 3 | 75297 | 3.1% |
| 2 | 65418 | 2.7% |
| 5 | 50656 | 2.1% |
| Other values (28) | 428964 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 947481 | |
| Decimal Number | 825100 | |
| Connector Punctuation | 315801 | 13.1% |
| Other Punctuation | 315702 | 13.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 237898 | |
| U | 211047 | |
| S | 206822 | |
| N | 40068 | 4.2% |
| C | 33137 | 3.5% |
| R | 25233 | 2.7% |
| E | 23068 | 2.4% |
| M | 20522 | 2.2% |
| L | 15880 | 1.7% |
| G | 15569 | 1.6% |
| Other values (16) | 118237 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 418331 | |
| 4 | 78148 | 9.5% |
| 3 | 75297 | 9.1% |
| 2 | 65418 | 7.9% |
| 5 | 50656 | 6.1% |
| 6 | 39377 | 4.8% |
| 7 | 29212 | 3.5% |
| 0 | 24377 | 3.0% |
| 9 | 23816 | 2.9% |
| 8 | 20468 | 2.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 315801 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 315702 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1456603 | |
| Latin | 947481 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 237898 | |
| U | 211047 | |
| S | 206822 | |
| N | 40068 | 4.2% |
| C | 33137 | 3.5% |
| R | 25233 | 2.7% |
| E | 23068 | 2.4% |
| M | 20522 | 2.2% |
| L | 15880 | 1.7% |
| G | 15569 | 1.6% |
| Other values (16) | 118237 |
Common
| Value | Count | Frequency (%) |
| 1 | 418331 | |
| _ | 315801 | |
| . | 315702 | |
| 4 | 78148 | 5.4% |
| 3 | 75297 | 5.2% |
| 2 | 65418 | 4.5% |
| 5 | 50656 | 3.5% |
| 6 | 39377 | 2.7% |
| 7 | 29212 | 2.0% |
| 0 | 24377 | 1.7% |
| Other values (2) | 44284 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2404084 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 418331 | |
| _ | 315801 | |
| . | 315702 | |
| A | 237898 | |
| U | 211047 | |
| S | 206822 | |
| 4 | 78148 | 3.3% |
| 3 | 75297 | 3.1% |
| 2 | 65418 | 2.7% |
| 5 | 50656 | 2.1% |
| Other values (28) | 428964 |
level1Name
Text
Missing 
| Distinct | 1914 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 288804 |
| Missing (%) | 47.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 30 |
| Mean length | 8.767492448 |
| Min length | 3 |
Unique
| Unique | 289 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Cartago |
|---|---|
| 2nd row | Alaska |
| 3rd row | Virginia |
| 4th row | Saint John |
| 5th row | Québec |
| Value | Count | Frequency (%) |
| california | 21273 | 5.5% |
| virginia | 20864 | 5.4% |
| colorado | 19719 | 5.1% |
| new | 13960 | 3.6% |
| arizona | 11623 | 3.0% |
| texas | 9899 | 2.5% |
| maryland | 8907 | 2.3% |
| florida | 8599 | 2.2% |
| indiana | 7690 | 2.0% |
| washington | 6994 | 1.8% |
| Other values (2081) | 260344 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 391229 | |
| i | 264505 | 9.6% |
| o | 236806 | 8.6% |
| n | 222321 | 8.0% |
| r | 191355 | 6.9% |
| e | 138677 | 5.0% |
| s | 124817 | 4.5% |
| l | 116315 | 4.2% |
| t | 92138 | 3.3% |
| d | 75412 | 2.7% |
| Other values (107) | 915392 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2293177 | |
| Uppercase Letter | 391604 | 14.1% |
| Space Separator | 74050 | 2.7% |
| Dash Punctuation | 8392 | 0.3% |
| Other Punctuation | 1708 | 0.1% |
| Modifier Symbol | 36 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 391229 | |
| i | 264505 | |
| o | 236806 | |
| n | 222321 | |
| r | 191355 | |
| e | 138677 | 6.0% |
| s | 124817 | 5.4% |
| l | 116315 | 5.1% |
| t | 92138 | 4.0% |
| d | 75412 | 3.3% |
| Other values (64) | 439602 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 66401 | |
| M | 38956 | 9.9% |
| N | 30780 | 7.9% |
| A | 25773 | 6.6% |
| V | 24825 | 6.3% |
| W | 24259 | 6.2% |
| T | 20704 | 5.3% |
| S | 18532 | 4.7% |
| I | 16011 | 4.1% |
| O | 15504 | 4.0% |
| Other values (25) | 109859 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 823 | |
| . | 405 | |
| / | 387 | |
| , | 58 | 3.4% |
| ! | 35 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 74050 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8392 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 36 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2684781 | |
| Common | 84186 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 391229 | |
| i | 264505 | 9.9% |
| o | 236806 | 8.8% |
| n | 222321 | 8.3% |
| r | 191355 | 7.1% |
| e | 138677 | 5.2% |
| s | 124817 | 4.6% |
| l | 116315 | 4.3% |
| t | 92138 | 3.4% |
| d | 75412 | 2.8% |
| Other values (99) | 831206 |
Common
| Value | Count | Frequency (%) |
| 74050 | ||
| - | 8392 | 10.0% |
| ' | 823 | 1.0% |
| . | 405 | 0.5% |
| / | 387 | 0.5% |
| , | 58 | 0.1% |
| ` | 36 | < 0.1% |
| ! | 35 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2752710 | |
| None | 16178 | 0.6% |
| Latin Ext Additional | 79 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 391229 | |
| i | 264505 | 9.6% |
| o | 236806 | 8.6% |
| n | 222321 | 8.1% |
| r | 191355 | 7.0% |
| e | 138677 | 5.0% |
| s | 124817 | 4.5% |
| l | 116315 | 4.2% |
| t | 92138 | 3.3% |
| d | 75412 | 2.7% |
| Other values (50) | 899135 |
None
| Value | Count | Frequency (%) |
| í | 4068 | |
| á | 3932 | |
| é | 2811 | |
| ü | 1323 | 8.2% |
| ó | 1117 | 6.9% |
| ô | 489 | 3.0% |
| Î | 457 | 2.8% |
| ø | 305 | 1.9% |
| ã | 253 | 1.6% |
| Ñ | 232 | 1.4% |
| Other values (39) | 1191 | 7.4% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ồ | 24 | |
| ẵ | 22 | |
| ằ | 16 | |
| ả | 9 | 11.4% |
| ế | 3 | 3.8% |
| ừ | 3 | 3.8% |
| ọ | 1 | 1.3% |
| ị | 1 | 1.3% |
level2Gid
Text
Missing 
| Distinct | 8078 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 297499 |
| Missing (%) | 49.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.27947722 |
| Min length | 7 |
Unique
| Unique | 1940 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | CRI.2.8_1 |
|---|---|
| 2nd row | USA.2.2_1 |
| 3rd row | USA.47.124_1 |
| 4th row | CAN.11.63_1 |
| 5th row | DEU.1.20_1 |
| Value | Count | Frequency (%) |
| usa.6.7_1 | 6808 | 2.2% |
| usa.6.11_1 | 6752 | 2.2% |
| can.13.1_1 | 6708 | 2.2% |
| usa.3.2_1 | 4440 | 1.4% |
| usa.5.55_1 | 3202 | 1.0% |
| usa.47.40_1 | 2960 | 1.0% |
| usa.50.54_1 | 2928 | 1.0% |
| usa.21.15_1 | 2888 | 0.9% |
| usa.21.16_1 | 2564 | 0.8% |
| usa.3.11_1 | 2272 | 0.7% |
| Other values (8068) | 265605 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 614117 | |
| 1 | 522032 | |
| _ | 307127 | |
| A | 235664 | 7.5% |
| U | 210206 | 6.7% |
| S | 205913 | 6.5% |
| 2 | 149564 | 4.7% |
| 3 | 133157 | 4.2% |
| 4 | 124441 | 3.9% |
| 5 | 100316 | 3.2% |
| Other values (28) | 554568 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1314516 | |
| Uppercase Letter | 921345 | |
| Other Punctuation | 614117 | |
| Connector Punctuation | 307127 | 9.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 235664 | |
| U | 210206 | |
| S | 205913 | |
| N | 40022 | 4.3% |
| C | 32403 | 3.5% |
| R | 23329 | 2.5% |
| E | 23052 | 2.5% |
| M | 17539 | 1.9% |
| L | 15130 | 1.6% |
| G | 14115 | 1.5% |
| Other values (16) | 103972 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 522032 | |
| 2 | 149564 | 11.4% |
| 3 | 133157 | 10.1% |
| 4 | 124441 | 9.5% |
| 5 | 100316 | 7.6% |
| 6 | 77578 | 5.9% |
| 7 | 64114 | 4.9% |
| 8 | 49850 | 3.8% |
| 0 | 47336 | 3.6% |
| 9 | 46128 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 614117 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 307127 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2235760 | |
| Latin | 921345 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 235664 | |
| U | 210206 | |
| S | 205913 | |
| N | 40022 | 4.3% |
| C | 32403 | 3.5% |
| R | 23329 | 2.5% |
| E | 23052 | 2.5% |
| M | 17539 | 1.9% |
| L | 15130 | 1.6% |
| G | 14115 | 1.5% |
| Other values (16) | 103972 |
Common
| Value | Count | Frequency (%) |
| . | 614117 | |
| 1 | 522032 | |
| _ | 307127 | |
| 2 | 149564 | 6.7% |
| 3 | 133157 | 6.0% |
| 4 | 124441 | 5.6% |
| 5 | 100316 | 4.5% |
| 6 | 77578 | 3.5% |
| 7 | 64114 | 2.9% |
| 8 | 49850 | 2.2% |
| Other values (2) | 93464 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3157105 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 614117 | |
| 1 | 522032 | |
| _ | 307127 | |
| A | 235664 | 7.5% |
| U | 210206 | 6.7% |
| S | 205913 | 6.5% |
| 2 | 149564 | 4.7% |
| 3 | 133157 | 4.2% |
| 4 | 124441 | 3.9% |
| 5 | 100316 | 3.2% |
| Other values (28) | 554568 |
level2Name
Text
Missing 
| Distinct | 6808 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 297510 |
| Missing (%) | 49.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 8.485347556 |
| Min length | 1 |
Unique
| Unique | 1657 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Turrialba |
|---|---|
| 2nd row | Aleutians West |
| 3rd row | Virginia Beach |
| 4th row | Les Collines-de-l'Outaouais |
| 5th row | Karlsruhe (Stadtkreis) |
| Value | Count | Frequency (%) |
| san | 7963 | 2.0% |
| boulder | 6808 | 1.7% |
| clear | 6752 | 1.7% |
| creek | 6752 | 1.7% |
| yukon | 6708 | 1.7% |
| montgomery | 4776 | 1.2% |
| cochise | 4440 | 1.1% |
| of | 3305 | 0.8% |
| tuolumne | 3202 | 0.8% |
| prince | 3200 | 0.8% |
| Other values (7084) | 336748 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 290297 | 11.1% |
| e | 228930 | 8.8% |
| o | 196468 | 7.5% |
| n | 191186 | 7.3% |
| r | 177117 | 6.8% |
| i | 155521 | 6.0% |
| l | 123512 | 4.7% |
| t | 99678 | 3.8% |
| s | 96666 | 3.7% |
| u | 90720 | 3.5% |
| Other values (145) | 955891 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2112006 | |
| Uppercase Letter | 386677 | 14.8% |
| Space Separator | 83538 | 3.2% |
| Other Punctuation | 8799 | 0.3% |
| Dash Punctuation | 7393 | 0.3% |
| Decimal Number | 4058 | 0.2% |
| Open Punctuation | 1892 | 0.1% |
| Close Punctuation | 1491 | 0.1% |
| Math Symbol | 73 | < 0.1% |
| Modifier Symbol | 59 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 290297 | |
| e | 228930 | |
| o | 196468 | |
| n | 191186 | |
| r | 177117 | 8.4% |
| i | 155521 | 7.4% |
| l | 123512 | 5.8% |
| t | 99678 | 4.7% |
| s | 96666 | 4.6% |
| u | 90720 | 4.3% |
| Other values (75) | 461911 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 54909 | |
| S | 35254 | 9.1% |
| B | 29908 | 7.7% |
| M | 28205 | 7.3% |
| P | 23720 | 6.1% |
| L | 18739 | 4.8% |
| T | 17554 | 4.5% |
| G | 16534 | 4.3% |
| W | 16511 | 4.3% |
| A | 15955 | 4.1% |
| Other values (37) | 129388 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1630 | |
| 2 | 403 | 9.9% |
| 8 | 399 | 9.8% |
| 6 | 390 | 9.6% |
| 5 | 327 | 8.1% |
| 7 | 316 | 7.8% |
| 9 | 184 | 4.5% |
| 0 | 152 | 3.7% |
| 4 | 143 | 3.5% |
| 3 | 114 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4243 | |
| ' | 3725 | |
| / | 407 | 4.6% |
| & | 359 | 4.1% |
| , | 37 | 0.4% |
| ? | 26 | 0.3% |
| # | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 83538 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7393 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1892 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1491 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 73 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 59 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2498683 | |
| Common | 107303 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 290297 | 11.6% |
| e | 228930 | 9.2% |
| o | 196468 | 7.9% |
| n | 191186 | 7.7% |
| r | 177117 | 7.1% |
| i | 155521 | 6.2% |
| l | 123512 | 4.9% |
| t | 99678 | 4.0% |
| s | 96666 | 3.9% |
| u | 90720 | 3.6% |
| Other values (122) | 848588 |
Common
| Value | Count | Frequency (%) |
| 83538 | ||
| - | 7393 | 6.9% |
| . | 4243 | 4.0% |
| ' | 3725 | 3.5% |
| ( | 1892 | 1.8% |
| 1 | 1630 | 1.5% |
| ) | 1491 | 1.4% |
| / | 407 | 0.4% |
| 2 | 403 | 0.4% |
| 8 | 399 | 0.4% |
| Other values (13) | 2182 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2589999 | |
| None | 15901 | 0.6% |
| Latin Ext Additional | 86 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 290297 | 11.2% |
| e | 228930 | 8.8% |
| o | 196468 | 7.6% |
| n | 191186 | 7.4% |
| r | 177117 | 6.8% |
| i | 155521 | 6.0% |
| l | 123512 | 4.8% |
| t | 99678 | 3.8% |
| s | 96666 | 3.7% |
| u | 90720 | 3.5% |
| Other values (65) | 939904 |
None
| Value | Count | Frequency (%) |
| í | 4250 | |
| é | 2909 | |
| á | 2737 | |
| ó | 2533 | |
| ñ | 880 | 5.5% |
| ú | 424 | 2.7% |
| Ó | 244 | 1.5% |
| ü | 174 | 1.1% |
| ã | 147 | 0.9% |
| ç | 136 | 0.9% |
| Other values (60) | 1467 | 9.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 20 | |
| ậ | 18 | |
| ạ | 14 | |
| ừ | 13 | |
| ờ | 10 | |
| ộ | 5 | 5.8% |
| ồ | 2 | 2.3% |
| ẫ | 2 | 2.3% |
| ỷ | 1 | 1.2% |
| ễ | 1 | 1.2% |
level3Gid
Text
Missing 
| Distinct | 4043 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 540301 |
| Missing (%) | 89.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 15 |
| Mean length | 11.95808784 |
| Min length | 11 |
Unique
| Unique | 1332 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | CRI.2.8.2_1 |
|---|---|
| 2nd row | CAN.11.63.6_1 |
| 3rd row | DEU.1.20.1_1 |
| 4th row | CHE.10.8.10_1 |
| 5th row | ZAF.9.4.1_1 |
| Value | Count | Frequency (%) |
| can.13.1.35_1 | 6689 | 10.4% |
| mmr.14.2.1_1 | 1323 | 2.1% |
| gbr.1.98.1_1 | 1301 | 2.0% |
| sen.1.3.3_1 | 961 | 1.5% |
| ind.31.3.1_1 | 744 | 1.2% |
| deu.1.20.1_1 | 733 | 1.1% |
| can.11.86.2_1 | 690 | 1.1% |
| idn.9.16.3_1 | 658 | 1.0% |
| per.18.1.3_1 | 654 | 1.0% |
| cri.2.7.3_1 | 505 | 0.8% |
| Other values (4033) | 50067 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 192969 | |
| 1 | 144312 | |
| _ | 64323 | 8.4% |
| 3 | 43315 | 5.6% |
| 2 | 37003 | 4.8% |
| N | 29353 | 3.8% |
| C | 28099 | 3.7% |
| A | 23531 | 3.1% |
| 4 | 20955 | 2.7% |
| 5 | 19011 | 2.5% |
| Other values (31) | 166333 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 318943 | |
| Other Punctuation | 192969 | |
| Uppercase Letter | 192933 | |
| Connector Punctuation | 64323 | 8.4% |
| Lowercase Letter | 28 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 29353 | |
| C | 28099 | |
| A | 23531 | |
| E | 14137 | 7.3% |
| R | 13020 | 6.7% |
| I | 11842 | 6.1% |
| D | 9374 | 4.9% |
| H | 7891 | 4.1% |
| L | 7199 | 3.7% |
| U | 6532 | 3.4% |
| Other values (13) | 41955 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 144312 | |
| 3 | 43315 | 13.6% |
| 2 | 37003 | 11.6% |
| 4 | 20955 | 6.6% |
| 5 | 19011 | 6.0% |
| 6 | 14785 | 4.6% |
| 8 | 11524 | 3.6% |
| 9 | 10979 | 3.4% |
| 7 | 10315 | 3.2% |
| 0 | 6744 | 2.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8 | |
| a | 8 | |
| b | 6 | |
| d | 4 | |
| e | 2 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 192969 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 64323 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 576243 | |
| Latin | 192961 | 25.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 29353 | |
| C | 28099 | |
| A | 23531 | |
| E | 14137 | 7.3% |
| R | 13020 | 6.7% |
| I | 11842 | 6.1% |
| D | 9374 | 4.9% |
| H | 7891 | 4.1% |
| L | 7199 | 3.7% |
| U | 6532 | 3.4% |
| Other values (18) | 41983 |
Common
| Value | Count | Frequency (%) |
| . | 192969 | |
| 1 | 144312 | |
| _ | 64323 | 11.2% |
| 3 | 43315 | 7.5% |
| 2 | 37003 | 6.4% |
| 4 | 20955 | 3.6% |
| 5 | 19011 | 3.3% |
| 6 | 14785 | 2.6% |
| 8 | 11524 | 2.0% |
| 9 | 10979 | 1.9% |
| Other values (3) | 17067 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 769204 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 192969 | |
| 1 | 144312 | |
| _ | 64323 | 8.4% |
| 3 | 43315 | 5.6% |
| 2 | 37003 | 4.8% |
| N | 29353 | 3.8% |
| C | 28099 | 3.7% |
| A | 23531 | 3.1% |
| 4 | 20955 | 2.7% |
| 5 | 19011 | 2.5% |
| Other values (31) | 166333 |
level3Name
Text
Missing 
| Distinct | 3911 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 541181 |
| Missing (%) | 89.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 10.42589645 |
| Min length | 2 |
Unique
| Unique | 1274 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | La Isabel |
|---|---|
| 2nd row | Pontiac |
| 3rd row | Karlsruhe |
| 4th row | Mesocco |
| 5th row | Bitou |
| Value | Count | Frequency (%) |
| unorganized | 7206 | 7.7% |
| yukon | 6689 | 7.2% |
| bokpyin | 1323 | 1.4% |
| elmbridge | 1301 | 1.4% |
| san | 1275 | 1.4% |
| thiaroye | 961 | 1.0% |
| n.a | 819 | 0.9% |
| la | 758 | 0.8% |
| coimbatore | 744 | 0.8% |
| karlsruhe | 733 | 0.8% |
| Other values (4216) | 71692 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 74787 | 11.3% |
| n | 56156 | 8.5% |
| o | 51299 | 7.8% |
| e | 42650 | 6.4% |
| r | 41094 | 6.2% |
| i | 40471 | 6.1% |
| 30056 | 4.5% | |
| u | 25946 | 3.9% |
| l | 20742 | 3.1% |
| t | 20128 | 3.0% |
| Other values (114) | 258142 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 519544 | |
| Uppercase Letter | 91006 | 13.8% |
| Space Separator | 30056 | 4.5% |
| Other Punctuation | 11503 | 1.7% |
| Decimal Number | 4837 | 0.7% |
| Dash Punctuation | 1922 | 0.3% |
| Open Punctuation | 1374 | 0.2% |
| Close Punctuation | 1228 | 0.2% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 74787 | |
| n | 56156 | |
| o | 51299 | |
| e | 42650 | 8.2% |
| r | 41094 | 7.9% |
| i | 40471 | 7.8% |
| u | 25946 | 5.0% |
| l | 20742 | 4.0% |
| t | 20128 | 3.9% |
| g | 19485 | 3.8% |
| Other values (58) | 126786 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 7912 | 8.7% |
| B | 7800 | 8.6% |
| Y | 7428 | 8.2% |
| S | 7002 | 7.7% |
| C | 6602 | 7.3% |
| M | 5940 | 6.5% |
| T | 5084 | 5.6% |
| P | 4964 | 5.5% |
| A | 4378 | 4.8% |
| K | 4251 | 4.7% |
| Other values (25) | 29645 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1354 | |
| 9 | 558 | |
| 2 | 551 | |
| 4 | 546 | |
| 3 | 425 | 8.8% |
| 5 | 374 | 7.7% |
| 8 | 363 | 7.5% |
| 6 | 296 | 6.1% |
| 7 | 206 | 4.3% |
| 0 | 164 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8166 | |
| . | 3041 | 26.4% |
| / | 160 | 1.4% |
| ' | 116 | 1.0% |
| ! | 17 | 0.1% |
| & | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 30056 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1922 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1374 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1228 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 610550 | |
| Common | 50921 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 74787 | 12.2% |
| n | 56156 | 9.2% |
| o | 51299 | 8.4% |
| e | 42650 | 7.0% |
| r | 41094 | 6.7% |
| i | 40471 | 6.6% |
| u | 25946 | 4.2% |
| l | 20742 | 3.4% |
| t | 20128 | 3.3% |
| g | 19485 | 3.2% |
| Other values (93) | 217792 |
Common
| Value | Count | Frequency (%) |
| 30056 | ||
| , | 8166 | 16.0% |
| . | 3041 | 6.0% |
| - | 1922 | 3.8% |
| ( | 1374 | 2.7% |
| 1 | 1354 | 2.7% |
| ) | 1228 | 2.4% |
| 9 | 558 | 1.1% |
| 2 | 551 | 1.1% |
| 4 | 546 | 1.1% |
| Other values (11) | 2125 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 657284 | |
| None | 4123 | 0.6% |
| Latin Ext Additional | 63 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 74787 | 11.4% |
| n | 56156 | 8.5% |
| o | 51299 | 7.8% |
| e | 42650 | 6.5% |
| r | 41094 | 6.3% |
| i | 40471 | 6.2% |
| 30056 | 4.6% | |
| u | 25946 | 3.9% |
| l | 20742 | 3.2% |
| t | 20128 | 3.1% |
| Other values (62) | 253955 |
None
| Value | Count | Frequency (%) |
| ó | 807 | |
| é | 767 | |
| ñ | 452 | |
| í | 450 | |
| ì | 270 | 6.5% |
| á | 263 | 6.4% |
| ê | 149 | 3.6% |
| ä | 137 | 3.3% |
| ü | 100 | 2.4% |
| è | 83 | 2.0% |
| Other values (31) | 645 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 14 | |
| ờ | 12 | |
| ồ | 11 | |
| ắ | 8 | |
| ọ | 6 | |
| ậ | 5 | 7.9% |
| ạ | 2 | 3.2% |
| ộ | 2 | 3.2% |
| ề | 2 | 3.2% |
| ế | 1 | 1.6% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 96088 |
| Missing (%) | 15.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 2 |
| Mean length | 2.000086523 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | NE |
| 3rd row | LC |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 354480 | |
| lc | 142489 | |
| vu | 5303 | 1.0% |
| dd | 2469 | 0.5% |
| cr | 2099 | 0.4% |
| en | 933 | 0.2% |
| nt | 742 | 0.1% |
| ex | 21 | < 0.1% |
| 2024-12-02t13:57:01.149z | 1 | < 0.1% |
| 2024-12-02t13:57:17.314z | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 356155 | |
| E | 355434 | |
| C | 144588 | |
| L | 142489 | |
| V | 5303 | 0.5% |
| U | 5303 | 0.5% |
| D | 4938 | 0.5% |
| R | 2099 | 0.2% |
| T | 744 | 0.1% |
| X | 21 | < 0.1% |
| Other values (12) | 46 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1017076 | |
| Decimal Number | 34 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 356155 | |
| E | 355434 | |
| C | 144588 | |
| L | 142489 | |
| V | 5303 | 0.5% |
| U | 5303 | 0.5% |
| D | 4938 | 0.5% |
| R | 2099 | 0.2% |
| T | 744 | 0.1% |
| X | 21 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 2 | 8 | |
| 0 | 5 | |
| 4 | 4 | |
| 3 | 3 | 8.8% |
| 7 | 3 | 8.8% |
| 5 | 2 | 5.9% |
| 9 | 1 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 | |
| . | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1017076 | |
| Common | 44 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 356155 | |
| E | 355434 | |
| C | 144588 | |
| L | 142489 | |
| V | 5303 | 0.5% |
| U | 5303 | 0.5% |
| D | 4938 | 0.5% |
| R | 2099 | 0.2% |
| T | 744 | 0.1% |
| X | 21 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 2 | 8 | |
| 0 | 5 | |
| - | 4 | |
| 4 | 4 | |
| : | 4 | |
| 3 | 3 | 6.8% |
| 7 | 3 | 6.8% |
| 5 | 2 | 4.5% |
| . | 2 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1017120 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 356155 | |
| E | 355434 | |
| C | 144588 | |
| L | 142489 | |
| V | 5303 | 0.5% |
| U | 5303 | 0.5% |
| D | 4938 | 0.5% |
| R | 2099 | 0.2% |
| T | 744 | 0.1% |
| X | 21 | < 0.1% |
| Other values (12) | 46 | < 0.1% |